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(57) Abstract: The present invention provides, inter alia, a glyphosate resistant EPSPS enzyme wherein in comparison with the 
wild type enzyme the EPSPS protein sequence is modified in that a first position is mutated so that the residue at this position is 
He rather that Thr and a second position is mutated so that the residue at this position is Ser rather than Pro, the mutations being 
introduced into EPSPS sequences which comprise the conserved region GNAGTAMRPL in the wild type enzyme such that modified 
sequence reads GNAGIAMRSL. The invention also includes glyphosate resistant plants regenerated from materia] transformed with 
polynucleotides which encode such EPSPS enzymes and a method of selectively controlling weeds in a field comprising such plants 
and glyphosate sensitive weeds, by the application to the field of glyphosate or an agronomically acceptable derivative. 
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HERBICIDE RESISTANT PLANTS 

The present invention relates to recombinant DNA technology, and in particular to 
the production of transgenic plants which exhibit substantial resistance or substantial 
tolerance to herbicides when compared with non transgenic like plants. The invention also 
5 relates, inter alia, to the nucleotide sequences (and expression products thereof), which are 
used in the production of, or are produced by, toe said transgenic plants. 

Plants which are substantially "tolerant" to a herbicide when they are subjected to it 
provide a dose/response curve which is shifted to toe right when compared with that 
provided by similarly subjected non tolerant like plants. Such dose/response curves have 
10 "dose" plotted on toe x-axis and "percentage kill","herbicidal effect" etc. plotted on the y- 
axis. Tolerant plants will typically require at least twice as much herbicide as non tolerant 
like plants in order to produce a given herbicidal effect Plants which are substantially 
"resistant" to the herbicide exhibit few, if any, necrotic, lytic, chlorotic or other lesions when 
subjected to the herbicide at concentrations and rates which are typically employed by the 
15 agricultural community to kill weeds in the field in which crops are to be grown for 
commercial purposes. 

It is particularly preferred that toe plants are substantially resistant or substantially 
tolerant to herbicides (hereinafter "glyphosate") which have 5-enol pyruvyl shikimate 
phosphate synthetase (hereinafter "EPSPS") as their site of action, of which N- 
20 phosphonometoylglycine (and its various salts) is toe pre-eminent example. 

The herbicide may be applied either pre- or post emergence in accordance with usual 
techniques for herbicide application to fields comprising crops which have been rendered 
resistant to toe herbicide. The present invention provides, inter alia, nucleotide sequences 
useful in toe production of such herbicide tolerant or resistant plants. 
25 According to the present invention there is provided a glyphosate resistant EPSPS 

enzyme wherein in comparison with toe wild type EPSPS toe protein sequence is modified in 
that a first position is mutated so that toe residue at this position is lie rather than Thr and a 
second position is mutated so that toe residue at this position is Ser rather than Pro, toe 
mutations being introduced into EPSPS sequences which comprise the conserved region 
30 GNAGTAMRPL in the wild type enzyme such that toe modified sequence reads 
GNAGIAMRSL, wherein toe enzyme does not comprise toe sequence 
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K(A/V)AKRAVWGCGGKFPVE, characterised in that the enzyme is stable and possesses 
at least one of the following sequence motifs (i) to (viii) in which X is any amino acid and Z, 
Zj and Z^are any amino acid other than those specified:- 
(i) ALLMZAPLA, wherein Z is not S or T; 
5 (ii) EIEIZDKL, wherein Z is not V; 

(iii) FG(V/I)(K/S)ZEH, wherein Z is not V; 

(iv) AL(K/R)ZLGL, wherein Z is not R; 

(v) GLXVEZ^XZjXXXACIAOV, wherein Z, is not T and/or Zj is not E; 

(vi) ITPPZ,K(L/V0(K/N)Z 2> wherein Z, is not K and/or Zj is not T; 
10 (vii) TIZ(D/N)PGCT, wherein Z is not N or L; 

(viii) (D/N)YFXVXXZXX(K/R)H, wherein Z is not R. 

A preferred embodiment of the enzyme comprises at least two of the motifs (i), (ii), 
(iii), (v) and (vi). In specific embodiments of the present inventive enzyme in motif (i) Z is 
A; in motif (ii) Z is I; in motif (iii) Z is A; in motif (iv) Z is K, T or A; in motif (v) Z 2 is R or 
15 A or less preferably D or E and 2^ is preferably V or A or less preferably T; in sequence 

motif (vi) Z l is E or A and Zj is P, I or V; in motif (vii) Z is R or K; and in motif (viii) Z is T 
or S or less preferably Q. Particularly preferred embodiments of the said enzyme comprise 
one or more sequences selected from the group consiting of: 
(i) (T/V)VEGCGG(I^/Q)FP(V/A/r)(S/G) 
20 (ii) ([AOWGCGG(T/L/Q)FP(V/A/T)E; 

(iii) (IAOWGCGG0WVQ)FP(V/A^(S/G); 

(iv) (W)WGCGGKFP(V/AnO(S/G); 

(v) (I/V)VEGCGGKFP(V /A/T)E; 

(vi) (IAOVEGCGG(I^L/Q)FP(V/A/r)E; 
25 (vii) (W)VEGCGGKFP(V/A^(S/G). 

The present invention also comprises an isolated polynucleotide comprising a region 
which encodes the present inventive enzyme, particular such polynucleotides comprising , 
for example, a sequence depicted in SEQ ED Nos. 4 or 5, or a sequence obtained by 
hybridising an intron located in the SEQ ID 4 or 5 sequences with polynucleotides comprised 
30 in plant genomic libraries. The invention also includes an isolated polynucleotide 
comprising a region encoding a chloroplast transit peptide and a glyphosate resistant 
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dicotyledenous 5-enolpymvylshikimate phosphate synthase (EPSPS) 3' of the peptide, the 
said region being under expression control of a plant operable promoter, with the provisos 
that the said promoter is not heterologous with respect to the said region, and the chloroplast 
transit peptide is not heterologous with respect to the said synthase. 
5 By "heterologous" is meant from a different source, and correspondingly "non- 

heterologous" means derived from the same source - but at a gene rather than organism or 
tissue level. For example the CaMV35S promoter is clearly heterologous with respect to a 
petunia EPSPS coding sequence insofar as the promoter is derived from a virus and the 
sequence - the expression of which it controls - from a plant. The term "heterologous" 

10 according to the present invention has a still narrower meaning, however. For example 
"heterologous" as it relates to the present invention means that the petunia EPSPS coding 
sequence is "heterologous" with respect to, for example, a promoter also derived from 
petunia - other than that which controls expression of the EPSPS gene. In this sense the 
petunia promoter derived from the petunia EPSPS gene then used to control expression of an 

15 EPSPS coding sequence likewise-derived from petunia is "non-heterologous" with respect to 
the said coding sequence. !! Non-heterologous" does not mean, however, that the promoter 
and coding sequence must necessarily have been obtained from one and the same (original or 
progenitor) polynucleotide. Likewise with respect to transit peptides. For example, a rubisco 
chloroplast transit peptide derived from sunflower is "heterologous" with respect to the 

20 coding sequence of an EPSPS gene likewise derived from sunflower (the same plant, tissue 
or cell). A rubisco transit peptide encoding sequence derived from sunflower is "non- 
heterologous" with respect to a rubisco enzyme encoding-sequence also derived from 
sunflower even if the origins of both sequences are different polynucleotides which may have 
been present in different cells, tissues or sunflower plants. 

25 Most particularly preferred forms of the said polynucleotide include a construct which 
comprises the following components in the 5 1 to 3' direction of transcription: either (1):- 
(i) At least one transcriptional enhancer being that enhancing region which is 
upstream from the transcriptional start of the sequence from which the enhancer is 
obtained and which enhancer per se does not function as a promoter either in the 

30 sequence in which it is endogenously comprised or when present heterologously as 

part of a construct; 



BNSOOCID: <WO 0226995A1J_> 



WO 02/26995 PCT/GBO 1/04 131 



(ii) The promoter from the soybean EPSPS gene; 

(iii) The soybean genomic sequence which encodes the soybean EPSPS 
chloroplast transit peptide; 

(iv) The genomic sequence which encodes the soybean EPSPS; 
5 (v) A transcriptional terminator; 

wherein the soybean EPSPS coding sequence is modified in comparison with a wild type 
sequence in that a first position is mutated so that the residue at this position is He rather than 
Thr and a second position is mutated so that the residue at this position is Ser rather than Pro, 
the mutations being introduced into EPSPS sequences which comprise the conserved region 
10 GNAGTAMRPLTAAV in the wild type enzyme such that modified sequence reads 
GNAGIAMRSLTAAV; or (2) 

(i) At least one transcriptional enhancer being that enhancing region which is 
upstream from the transcriptional start of the sequence from which the enhancer is 
obtained and which enhancer per se does not function as a promoter either in the 

15 sequence in which it is endogenously comprised or when present heterologously as 

part of a construct; 

(ii) The promoter from the Brasicca napns EPSPS gene; 

(iii) The Brasicca napus genomic sequence which encodes the Brasicca napus 
EPSPS chloroplast transit peptide; 

20 (iv) The genomic sequence which encodes the Brasicca napus EPSPS; 

(v) A transcriptional terminator; 

wherein the Brasicca napus EPSPS coding sequence is modified in comparison with a wild 

type sequence in that a first position is mutated so that the residue at this position is lie rather 

than Thr and a second position is mutated so that the residue at this position is Ser rather than 
25 Pro, the mutations being introduced into EPSPS sequences which comprise the conserved 

region GNAGTAMRPLTAAV in the wild type enzyme such that the modified sequence 

reads GNAGIAMRSLTAAV. 

The present inventive polynucleotide, particularly the two disclosed in the 

immediately preceding paragraphs, may further comprise a sequence which encodes a 
30 chloroplast transit peptide/phosphoenolpyruvate synthase (CTP/PPS) or a chloroplast transit 

peptide/pyruvate orthophosphate di-kinase (CTP/PPDK), which sequence is under 
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expression control of a plant operable promoter. It is particularly preferred that the PPDK 
protein sequence is directed to the chloroplast in planta via its autologous transit peptide, and 
accordingly the polynucleotide of the immediately preceding sentence encodes such a non- 
heterologous combination. The PPDK encoding sequence can be derived from either a 

5 monocot or a dicot and the protein thus encoded is optionally not cold-labile. By "not cold 
labile" is meant that the enzyme retains at least 50% of its activity when incubated at 0 
degrees Celsius for 5 minutes under experimental conditions similar to those referred to in 
Usami et al (1995) Plant Mol. Biol, 27, 969-980. Such a 'cold stable' enzyme is at least 
two-fold more stable than the like enzyme isolated from R bidentis. The PPS or PPDK 

10 enzymes provide for elevated levels of PEP in the chloroplast which is the site of action of 
glyphosate and which contains the EPSPS enzyme. 

The enhancing region of the polynucleotide preferably constitutes a sequence the 3' 
end of which is at least 40 nucleotides upstream of the closest transcriptional start of the 
sequence from which the enhancer is obtained. In a further embodiment of the 

15 polynucleotide, the enhancing region constitutes a region the 3' end of which is at least 60 
nucleotides upstream of the said closest start, and in a still further embodiment of the 
polynucleotide the said enhancing region constitutes a sequence the 3' of which is at least 10 
nucleotides upstream from the first nucleotide of the TATA consensus of the sequence from 
which the enhancer is obtained. 

20 The polynucleotide according to the invention may comprise two or more 

transcriptional enhancers, which in a particular embodiment of the polynucleotide may be 
tandemly present. 

In the present inventive polynucleotide the 3' end of the enhancer, or first enhancer if 
there is more than one present, may be between about 100 to about 1000 nucleotides 

25 upstream of the codon corresponding to the translational start of the EPSPS transit peptide or 
the first nucleotide of an intron or other untranslated leader sequence in the 5' untranslated 
region in the case that the said region contains an intron or other untranslated sequence. In a 
more preferred embodiment of the polynucleotide, the 3' end of the enhancer, or first 
enhancer, is between about 150 to about 1000 nucleotides upstream of the codon 

30 corresponding to the translational start of the EPSPS transit peptide or the first nucleotide of 
in the 5* untranslated region, and in a still more preferred embodiment the 3' end of the 
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inventive polynucleotide, the 3' end of the enhancer, or first 
ocated between about 300 to about 380 nucleotides upstream of the codon 
translational start of the EPSPS transit peptide or the first nucleotide of 
intranslated leader sequence in the 5' untranslated region, and in a 

of this alternative polynucleotide the 3' end of the enhancer, or first 
between about 320 to about 350 nucleotides upstream of the codon 
translational start of the EPSPS transit peptide, or the first nucleotide of 
leader sequence in the 5 1 untranslated region, 
according to the invention, the region upstream of the promoter 
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transcriptional start of a promoter selected from the group consisting of 
rolDFd, S-adenosyl homocysteinase, histone, tubulin, polyubiquitin and 
and the CaMV35S and FMV35S genes. 
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Alternatively, the polynucleotide may comprise in the 5' to 3' direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence which is 
upstream from the transcriptional start of either the CaMV35S or FMV35S promoters, and a 
' second enhancer comprising a transcriptional enhancing region derived from a sequence 
5 which is upstream from the transcriptional start of a histone gene. 

Alternatively, the polynucleotide may comprise in the 5' to 3' direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence which is 
upstream from the transcriptional start of either the CaMV35S or FMV35S promoters, and a 
second enhancer comprising a transcriptional enhancing region derived from a sequence 
10 which is upstream from the transcriptional start of a tubulin gene. 

Alternatively, the polynucleotide may comprise in the 5 1 to 3' direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence which is 
upstream from the transcriptional start of the FMV35S promoter and a second enhancer 
comprising a transcriptional enhancing region derived from a sequence which is upstream 
15 from the transcriptional start of the CaMV35S promoter. 

Alternatively, the polynucleotide may comprise in the 5' to 3 1 direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence which is 
upstream from the transcriptional start of the CaMV35S promoter and a second enhancer 
comprising a transcriptional enhancing region derived from a sequence which is upstream 
20 from the transcriptional start of the FMV35S promoter. 

Whatever the identity and juxtaposition of the various enhancers present in the 
polynucleotide, the nucleotides 5' of the codon which constitutes the translational start of the 
EPSPS chloroplast transit peptide may be Kozack preferred. What is meant by this is well 
known to the skilled man and will be further apparent from the examples below. 
25 Particularly preferred embodiments of the present inventive polynucleotide comprise 

up stream of the sequence encoding the EPSPS chloroplast transit peptide a sequence 
encoding a non-translated 5' leader sequence derived from a relatively highly expressed gene, 
such as that of the glucanase, chalcbne synthase and Rubisco genes. 

The polynucleotide of the invention may also comprise a virally derived translational 
30 enhancer located within the non translated region 5* of the genomic sequence which encodes 
the EPSPS chloroplast transit peptide. The man skilled in the art is aware of the identity of 
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such suitable translational enhancers - such as the Omega and Omega prime sequences 
derived from TMV and that derived from the tobacco etch virus, and how such translational 
enhancers can be introduced into the polynucleotide so as to provide for the desired result of 
increased protein expression. 
5 The polynucleotide according to the invention may further comprise regions encoding 

proteins capable of conferring upon plant material containing it at least one of the following 
agronomically desirable traits: resistance to insects, fungi, viruses, bacteria, nematodes, 
stress, dessication, and herbicides. Whilst such a polynucleotide contemplates the herbicide 
resistance conferring gene being other than an EPSPS, such as glyphosate oxido-reductase 

10 (GOX) for example, the herbicide may be other than glyphosate in which case the resistance 
conferring genes may be selected from the group encoding the following proteins: 
phosphinothricin acetyl transferase (PAT), hydroxyphenyl pyruvate dioxygenase (HPPD), 
glutathione S transferase (GST), cytochrome P450, Acetyl-COA carboxylase (ACCase), 
Acetolactate synthase (ALS), protoporphyrinogen oxidase (PPO), dihydropteroate synthase, 

15 polyamine transport proteins, superoxide dismutase (SOD), bromoxynil nitrilase, phytoene 
desaturase (PDS), the product of the tfdA gene obtainable from Alcaligenes eutrophus, and 
known mutagenised or otherwise modified variants of the said proteins. In the case that the 
polynucleotide provides for multiple herbicide resistance such herbicides may be selected 
from the group consisting of a dinitroaniline herbicide, triazolo-pyrimidines, a uracil, a 

20 phenylurea, a triketone, an isoxazole, an acetanilide, an oxadiazole, a triazinone, a 

sulfonanilide, an amide, an anilide, an isoxaflutole, a flurochloridone, a norflurazon, and a 
triazolinone type herbicide and the post-emergence herbicide is selected from the group 
consisting of glyphosate and salts thereof, glufosinate, asulam, bentazon, bialaphos, bromacil, 
sethoxydim or another cyclohexanedione, dicamba, fosamine, flupoxam, phenoxy propionate, 

25 quizalofop or another aryloxy-phenoxypropanoate, picloram, fluormetron, atrazine or another 
triazine, metribuzin, chlorimuron, chlorsulfuron, flumetsulam, halosulfuron, sulfometron, 
imazaquin, imazethapyr, isoxaben, imazamox, metosulam, pyrithrobac, rimsulfuron, 
bensulfuron, nicosulfuron, fomesafen, fluroglycofen, KIH9201, ET751, carfentrazone, 
mesotrione, sulcotrione, paraquat, diquat, bromoxynil and fenoxaprop. 

30 A particular embodiment of the polynucleotide according to the invention provides 

for resistance to both glyphosate and a protoporphyrinogen oxidase (PPGO) inhibitor, in 



BNsnoan: <wo 



0?26995A1 I > 



WO 02/26995 



PCT/GB01/04131 



particular butafenicil, for which the corresponding resistance protein is a PPGO or an 
inhibitor resistant variant thereof. A particularly preferred construct encoding such a 
herbicide resistance conferring combination comprises the EPSPS encoding sequence of the 
invention in combination with a sequence which encodes a PPDK or PPS in combination 

5 with a PPGO inhibitor resistant PPGO enzyme encoding sequence - such sequences being 
directed to the chloroplast - if desirable - by their autologous transit peptides and being 
under the expression control of non-heterologous promoters if the protein encoding 
sequences are of plant as opposed to bacterial origin. 

In the case that the polynucleotide comprises sequences encoding insecticidal 

10 proteins, these proteins may be selected from the group consisting of crystal toxins derived 
from Bt, including secreted Bt toxins; protease inhibitors, lectins, 

Xenhorabdus/Photorhabdus toxins. The fungus resistance conferring genes may be selected 
from the group consisting of those encoding known AFPs, defensins, chitinases, glucanases, 
and Avr-Cf9. Particularly preferred insecticidal proteins are crylAc, crylAb, cry3A, Vip 

15 lA,Vip IB, Vip3A, Vip3B, cysteine protease inhibitors, and snowdrop lectin. In the case 
that the polynucleotide comprises bacterial resistance conferring genes these may be selected 
from the group consisting of those encoding cecropins and techyplesin and analogues 
thereof Virus resistance conferring genes may be selected from the group consisting of 
those encoding virus coat proteins, movement proteins, viral replicases, and anti-sense and 

20 ribozyme sequences which are known to provide for virus resistance; whereas the stress, salt, 
and drought resistance conferring genes may be selected from those that encode Glutathione- 
s-transferase and peroxidase, the sequence which constitutes the known CBF1 regulatory 
sequence and genes which are known to provide for accumulation of trehalose. 

The polynucleotide according to the invention may be modified to enhance 

25 expression of the protein encoding sequences comprised by it, in that mRNA instability 
motifs and/or fortuitous splice regions may be removed, or crop preferred codons may be 
used so that expression of the thus modified polynucleotide in a plant yields substantially 
similar protein having a substantially similar activity/function to that obtained by expression 
of the protein encoding regions of the unmodified polynucleotide in the organism in which 

30 such regions of the unmodified polynucleotide are endogenous. The degree of identity 
between the modified polynucleotide and a polynucleotide endogenously contained within 
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the said plant and encoding substantially the same protein may be such as to prevent co- 
suppression between the modified and endogenous sequences. In this case the degree of 
identity between the sequences should preferably be less than about 70%. 

The invention still further includes a biological or transformation vector comprising 
the present inventive polynucleotide. Accordingly, by "vector" is meant, inter alia, one of the 
following: a plasmid, virus, cosmid or a bacterium transformed or transfected so as to contain 
the polynucleotide. 

The invention still further includes plant material which has been transformed with 
the said polynucleotide or vector, as well as such transformed plant material which has been, 
or is, further transformed with a polynucleotide comprising regions encoding proteins 
capable of conferring upon plant material containing it at least one of the following 
agronomically desirable traits: resistance to insects, fungi, viruses, bacteria, nematodes, 
stress, dessication, and herbicides. 

The invention still further includes morphologically normal fertile whole plants 
which result from the crossing of plants which have been regenerated from material which 
has been transformed with the polynucleotide of the invention and plants ,which result from 
regeneration of material transformed with a polyncleotide which comprises a sequence which 
encodes a protein, such as a phosphoenolpyruvate synthase (PPS) or pyruvate 
orthophosphate di-kinase (PPDK) which proteins are capable of providing for elevated levels 
of phosphoenolpyruvate in the chloioplast. Such proteins are directed to the chloroplast by a 
suitable transit peptide - in the case of the PPDK enzyme preferably by the autologous 
peptide. 

The invention still further includes morphologically normal fertile whole plants 
which contain the present inventive polynucleotide and which result from the crossing of 
plants which have been regenerated from material transformed with the present inventive 
polynucleotide or vector, and plants which have been transformed with a polynucleotide 
comprising regions encoding a CTP/PPS or CTP/PPDK, and/or regions encoding proteins 
capable of conferring upon plant material containing it at least one of the following 
agronomically desirable traits: resistance to insects, fungi, viruses, bacteria, nematodes, 
stress, dessication, and herbicides. The invention also includes progeny of the resultant 
plants, their seeds and parts. 
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Plants of the invention may be selected from the group consisting of field crops, 
fruits and vegetables such as canola, sunflower, tobacco, sugar beet, cotton, maize, wheat, 
barley, rice, sorghum, mangel worzels, tomato, mango, peach, apple, pear, strawberry, 
banana, melon, potato, carrot, lettuce, cabbage, onion, soya spp, sugar cane, pea, field beans, 
5 poplar, grape, citrus, alfalfa, rye, oats, turf and forage grasses, flax and oilseed rape, and nut 
producing plants insofar as they are not already specifically mentioned, their progeny, seeds 
and parts. 

Particularly preferred such plants include soybean, canola, brassica, cotton, sugar 
beet, sunflower, peas, potatoes and mangel worzels. 

10 The invention still further comprises a method of selectively controlling weeds in a 

field, the field comprising weeds and plants of the invention or the herbicide resistant 
progeny thereof,, the method comprising application to the field of a glyphosate type 
herbicide in an amount sufficient to control the weeds without substantially affecting the 
plants. According to this method, one or more of a herbicide, insecticide, fungicide, 

15 nematicide, bactericide and an anti-viral may be applied to the field (and thus the plants 
contained within it) either before or after application of the glyphosate herbicide. 

The invention still further provides a method of producing plants which are 
substantially tolerant or substantially resistant to glyphosate herbicide, comprising the steps 
of: 

20 (i) transforming plant material with the polynucleotide or vector of the invention; 

(ii) selecting the thus transformed material; and 

(iii) regenerating the thus selected material into morphologically normal fertile 
whole plants. 

The transformation may involve the introduction of the polynucleotide into the 
25 material by any known means, but in particular by: (0 Holistic bombardment of the material 
with particles coated with the polynucleotide; Qi) by impalement of the material on silicon 
carbide fibres which are coated with a solution comprising the polynucleotide; or (iii) by 
introduction of the polynucleotide or vector into Agrobacterium and co-cultivation of the 
thus transformed Agrobacterium with plant material which is thereby transformed and is 
30 subsequently regenerated. Plant transformation, selection and regeneration techniques, 
which may require routine modification in respect of a particular plant species, are well 
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known to the skilled man. The thus transformed plant material may be selected by its 
resistance to glyphosate. 

The invention still further provides the use of the present inventive polynucleotide or 
vector in the production of plant tissues and/or morphologically normal fertile whole plants 
5 which are substantially tolerant or substantially resistant to glyphosate herbicide. 

The invention still further includes a method of selecting biological material 
transformed so as to express a gene of interest, wherein the transformed material comprises 
the polynucleotide or vector of the invention, and wherein the selection comprises exposing 
the transformed material to glyphosate or a salt thereof, and selecting surviving material. 
10 The said material may be of plant origin, and may in particular be derived from a dicot 
selected from the group consisting of soybean, sugar beet, cotton and the Brassicas. 

The invention still further includes a method for regenerating a fertile transformed 
plant to contain foreign DNA comprising the steps of: 

(a) producing regenerable tissue from said plant to be transformed; 
1 5 (b) transforming said regenerable tissue with said foreign DNA, wherein said 

foreign DNA comprises a selectable DNA sequence, wherein said sequence functions 

in a regenerable tissue as a selection device; 

(c) between about one day to about 60 days after step (b), placing said 
regenerable tissue from step (b) in a medium capable of producing shoots from said 

20 tissue, wherein said medium may further contain a compound used to select 

regenerable tissue containing said selectable DNA sequence to allow identification or 
selection of the transformed regenerated tissue; 

(d) after at least one shoot has formed from the selected tissue of step (c) 
transferring said shoot to a second medium capable of producing roots from said 

25 shoot to produce a plantlet, wherein the second medium optionally contains the said 

compound; and 

(e) growing said plantlet into a fertile transgenic plant wherein the foreign DNA 
is transmitted to progeny plants in Mendelian fashion, characterised in that the foreign DNA 
is, or the selectable DNA sequence comprised by the foreign DNA comprises, the 

30 polynucleotide according to the invention, and the said compound is glyphosate or a salt 
thereof. The plant may be a dicot as indicated above - more preferably selected from 
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soybean, canola, brassica, cotton, sugar beet, sunflower, peas, potatoes and mangel worzel, 
and the said regenerable tissue may consist of embryogenic calli, somatic embryos, immature 
embryos etc. 

The present invention also includes a diagnostic kit comprising means for detecting 
the present inventive enzyme or polynucleotide or enzymes encoded by it, and therefore 
suitable for identifying tissues or samples which contain these. The polynucleotides can be 
detected by PCR amplification as is known to the skilled man - based on primers which he 
can easily derive from the enzyme encoding sequences which are disclosed in this 
application. The enzymes per se can be detected by, for example, the use of antibodies 
which have been raised against them for diagnosticaUy distinguishing the antigenic regions 
which they contain. 

The present invention will be further apparent from the following description taken in 
conjunction with the associated drawings and sequence listings. 
Of the Sequences: 
15 SEQ ID No.l depicts Brassicia napus genomic DNA. 
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18 
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49 ECPPS1 (PCR primer) 

50 ECPPS2 (PCR primer) 

5 1 -65 Motifs present in the stable EPSPS proteins of the invention 

5 

EXAMPLES 

Methods useful for cloning, expressing and characterising the stable glyphosate- 
resistant EPSP synthases of the current invention are described herein 

Stable glyphosate- resistant EPSPSs of the current invention are characterised as 

10 being stable (retaining > ~ 80% of their catalytic activity) after being incubated in a suitable 
buffer at 37 C for at least 4 hours. As exemplified below, enzyme stability experiments can 
suitably be carried out using substantially purified, part purified or crude preparations of 
enzyme extracts (for example the latter are readily obtained as extracts of transgenic plants 
engineered to express the glyphosate-resistant EPSPS or, similarly, extracts of microbes 

15 engineered to express a transgene encoding the glyphosate-resistant EPSPS). Ideally, 
glyphosate-resistant enzymes are tested for stability both in a purified form and then as 
purified protein spiked back into a crude plant extract so as to test them under more realistic 
in planta -like conditions. In such extracts^ the enzyme activity due to glyphosate-resistant 
EPSPS can easily be discriminated from background susceptible endogenous EPSPS by 

20 testing activity over a range of glyphosate concentrations. At a discriminating concentration 
of glyphosate (e.g 0.1 mM glyphosate in the presence of 0.1 mM PEP) virtually all of the 
measured activity will originate from the resistant enzyme 

Example, 1 Method for identification of stable glyphosate-resistant EPSP synthases 
In order to test the stability of a glyphosate-resistant EPSPS, a test solution of the 
25 enzyme (prepared as described below in buffer at pH ~ 7.0-7.5 and in purified or part 
purified form) is heated to 37 C for 4-5h. The glyphosate-resistant activity of the heated 
enzyme is then compared with the activity of a) a sample similarly prepared but kept at ice 
temperature and b) a similar sample but freshly prepared. 

Stable glyphosate resistant enzymes of the current invention are characterised by the 
30 fact that, following incubation for 5 h at 37 C, they lose litde, if any, of their original 

glyphosate-resistant catalytic activity (< 20% , preferably < 15% and more preferably < 10%) 
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relative to suitable controls when assayed as described below. Preferably the stability 
experiment is carried out using substantially pure enzyme although a very similar result will 
normally be obtained using crude preparations, or when pure protein is spiked back into 
crude extract. Methods suitably used for assay and preparation of enzyme extracts are 
5 further detailed below. The stability of the glyphosate-resistant EPSPS can suitably be 
assessed in a variety of other, essentially equivalent ways (for example by heating the 
enzyme at a slightly higher temperature for a shorter time or a lower temperature for a longer 
period... e.g at 25 C for 3d). 

Example 2. Method for assaying EPSPS activity and determination of kinetic constants. 

10 Assays are carried out generally according to the radiochemical method of Padgette et al 

1987 (Archives of Biochemistry and Biophysics, 258(2) 564-573) with K+ ions as the major 
species of cationic counterion. Assays in a total volume of 50|il, in 50mM Hepes(KOH) pH 
7.0 at 25°C, contain purified enzyme or plant extract (see below) diluted appropriately in 
Hepes pH 7.0 containing 10% glycerol, and 5mM DTT, 14 C PEP either as variable substrate( 

15 for kinetic determinations) or fixed at 100 or 250 pM and shikimate 3 Phosphate (K+ salt) at 
0.75 or 2 mM as indicated. Optionally, for assays of crude plant extracts, assays also contain 
5 mM KF and/or 0.1 mM ammonium molybdate. Assays are started with the addition of 14 C 
phosphoenolpyruvate (cyclohexylammonium+ salt) and stopped after 2-10 minutes (2 m 
minutes is preferable) with the addition of 50fil of a solution of 1 part 1M acetic acid and 9 

20 parts ethanoL After stopping, 20\i\ is loaded onto a synchropak AX100 ( 25cm x 4.6mm ) 
column and chromatographed using isocratic elution with a 0.28M potassium phosphate pH 
6.5 mobile phase flowing at 0.5 ml/min over 35 minutes. Under these conditions the 
retention times for PEP and EPSP are, depending on the individual columns, - 19 and 25 
minutes respectively. A CP 525TR scintillation counter is connected to the end of the AX 

25 100 column. It is fitted with a 0.5ml flow cell, and the flow rate of scintillant ( Ultima Flo 
AP ) is set at Iml/min. Relative peak areas' of PEP and EPSP are integrated to determine the 
percentage conversion of labelled PEP to EPSP. Apparent and Vmax values are 
determined by least squares fit to a hyperbola with simple weighting using the Grafit 3 .09b 
from Erithacus Software Ltd. Km values are generally ascertained using 8-9 concentrations 

30 of variable substrate ranging from K^/2 - lOK^ and triplicate points. Except where 
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specifically noted, data points are only included in the analysis where there is < 30% 
conversion of substrate to EPSP. 

Shikimate-3-Pi (S3P) is prepared as follows, To 7mls of 0.3M TAPS pH 8.5 
containing 0.05M Shikimate, 0.0665M ATP ( Na salt ), lOmM KF, 5mM DTT, and 0.05M 
5 MgCl 2 .6H 2 0, 75|il of a 77 unit ( nmol min 1 ) ml" 1 solution of shikimate kinase is added. After 
24hrs at room temperature, the reaction is stopped by brief heating to 95°C. The reaction 
solution is diluted 50 fold in 0.0 1M Tris HC1 pH 9, and chromatographed by anion exchange 
on Dowex 1 X 8 - 400, using a 0 - 0.34M LiCl 2 gradient. The S3P fractions are combined, 
freeze dried, and then redissolved in 7mls distilled H 2 0. 28mls of 0.1M Ba(CH 3 COOH) 2 and 

10 189mls of absolute ethanol are then added. This solution is left to stir overnight at 4°C. The 
resulting precipitate of tri-Barium S3P is collected and washed in 30mls of 67% ethanol. The 
washed precipitate is then dissolved in - 30mls distilled H 2 0. By adding K 2 S0 4 the K + salt of 
S3P is produced as required. Great care is taken to add a minimal excess of sulphate. The 
BaS0 4 precipitate is removed and the supernatant containing the required salt of S3P freeze 

15 dried. Each salt is weighed and analysed by proton NMR. S3P preparations so-prepared are 
> 90% pure according to proton NMR and, according to their weights and integration of 3 IP 
NMR, contain only low residues of potassium sulphate. 

Example 3. Preparation of extracts of plant material suitable for EPSPS assay 

Callus or plantlet material (0.5 -1 .0 g) is ground to a fine frozen powder in a liquid 

20 nitrogen-chilled mortar and pestle. This powder is taken up in an equal volume of a suitable 
chilled extraction buffer (for example, 50 mM Hepes/ KOH buffer at pH 7.5 containing 1 
mM EDTA, 3 mM DTT, 1.7 mM 'pefabloc' (serine protease inhibitor), 1 .5 mM leupeptin, 
1.5 mM pepstatin A, 10% v/v glycerol and 1% polyvinylpyrolidone), resuspended, mixed 
and centrifuged in a chilled centrifuge to bring down debris. The supernatant is exchanged 

25 down a chilled PD10 column of Sephadex G25 into 25 mM Hepes/ KOH buffer at pH 7.5 
containing 1 mM EDTA, 3 mM DTT and 10% v/v glycerol. Protein is estimated by the 
Bradford method standardised using bovine serum albumen. A portion of the extract is 
frozen in liquid nitrogen; a portion is assayed immediately. 

Example 4. Method for assaying EPSPS activities in crude plant materials and 
30 discriminating the proportion of the total which is resistant to glyphosate. 
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EPSPS assays of plant extracts are carried out, as described above, with 0.1 mM 14 C- 
PEP and 0.75 mM shikimate-3-Pi either in the absence or the presence of 0.1 mM N- 
(phosphonomethyl)glycine. Under these assay conditions, the resistant forms of EPSPS of 
the current invention are estimated to be inhibited by < 8.5% whilst the sensitive w/t form is 

5 essentially fully inhibited (> 98%). Thus, the level of activity observed in the presence of 
glyphosate (A) is taken to represent - 92% of the level of resistant enzyme derived from 
expression of the transgene whilst the level of susceptible w/t EPSPS is taken to be the total 
level of EPSPS activity observed in the absence of glyphosate minus the value of A x - 1 .08. 
Because the Vmax of the mutant enzymes of the current invention are estimated to be, for 

10 example, only about a third of the Vmax of the w/t enzyme (and because the Km values for 
PEP of both w/t and mutant forms are estimated to be about 20 |xM or less), the level of 
expression of the mutant enzyme polypeptide relative to the level of expression of the 
endogenous w/t EPSPS is taken, to be about three fold higher than the ratio calculated on the 
basis of the ratio of their relative observed activities. The total level of EPSPS polypeptide 

15 expression (mutant + w/t) is also estimated by Western blotting. 

Example 5. Cloning and expression of w/t cDNA encoding mature Bra ssica napus 
EPSPS in KcolL 

Brassica napus EPSPS cDNA is amplified using RT-PCR from RNA isolated from 
glass-house grown Brassica napus plants using Superscript RT from BRL according to the 

20 recommendation supplied by the manufacturer. PCR is performed using Pfu turbo 
polymerase from Stratagene according to the methods supplied by the manufacturer. 
Suitably designed oligonucleotide primers based on the known nucleotide sequence (EMBL 
accession X51475) are used in the amplification reaction and the reverse transcription steps. 
The PCR product is cloned into pCRBlunt II using Invitrogens Zero Blunt TOPO kit. The 

25 sequence of the insert is confirmed by sequencing and it is verified that the predicted open 
reading frame corresponds to that of the predicted mature chloroplastic Brassica napus 
EPSPS protein with the exception of the presence of an initiating Met. The cloned and 
verified EPSPS sequence is excised using suitable restriction enzymes as known in the art 
and the purified fragment is cloned into pET24a (Novagen) digested similarly. The 

30 recombinant clones are introduced into BL21 (DE3) a codon-optimised RP strain of Kcoli 
supplied by Stratagene. The EPSPS protein is expressed in this strain following addition of 1 
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mM IPTG to the fennenter medium (LB supplemented with lOOug/ml Kanamycin). The 
recombinant protein of the correct predicted mass is identified (i) on the basis of Coomassie 
staining of SDS gels of cell extracts and side by side comparison with Coomassie-stained 
gels of extracts of similar E. coli cells transformed with an empty pET24a vector and ii) by 
5 western analysis using a polyclonal antibody raised to previously-purified plant EPSPS 
protein. 

Example 6. Cloning and expression in E.coli of glyphosate-resistant mutant Brassica 
napus EPSPS 

The Brassica napus EPSPS cDNA in pCRBlunt is used as a template for two further 

10 PCR using primer pairs designed to introduce specific changes such that the resulting 
reaction product encodes a modified EPSPS protein wherein a first position is mutated so 
that the residue at this position is lie rather than Thr and a second position is mutated so that 
the residue at this position is Ser rather than Pro, the mutations being introduced into EPSPS 
sequences which comprise the conserved region GNAGTAMRPLTAAV in the wild type 

15 enzyme such that the modified sequence reads GNAGIAMRSLTAAV. The sequence of 
Brassica napus EPSPS gene is known and methods for designing primers to effect this 
mutagenesis are well-known in the art 

The resultant PCR product is gel purified and cloned into pCRBlunt II using Invitrogens 
Zero Blunt TOPO kit It is confirmed that the DNA sequence of the insert and its predicted 

20 open reading frame correspond to that of the predicted mature chloroplastic Brassica napus 
EPSPS protein (with the exception of the presence of an initiating Met) and also that the 
desired changes (the specific mutation of T to I and P to S at specific positions in the EPSPS 
sequence) are encoded . The thus cloned and verified B. napus EPSPS sequence is excised 
using suitable restriction enzymes and the purified fragment cloned into pET24a (Novagen) 

25 digested similarly. The recombinant clones are introduced into BL21 (DE3), a codon 

optimised RP strain of Kcoli supplied by Stratagene. The EPSPS protein is expressed in this 
strain following addition of 1 mM IPTG to the fennenter medium (LB supplemented with 
lOOug/ml Kanamycin). The recombinant protein of the correct predicted mass is identified i) 
on the basis of Coomassie staining of SDS gels of cell extracts and side by side comparison 

30 with Coomassie-stained gels of extracts of similar E. coli cells transformed with an empty 
pET24a vector and ii) by Western analysis using a polyclonal antibody raised to previously- 
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purified plant EPSPS protein. This mutant form of Brassica napus EPSPS is purified and 
characterised as described herein. 

Example 7» Purification and characterisation of mutant Brassica napus EPSPS 

The mature mutant Brassica napus EPSPS protein is purified at - 4 C as follows. 25 
5 g wet weight of cells are washed in 50 ml of 0.1M Hepes/ KOH buffer at pH 7.5 containing 5 
mM DTT, 2 mM EDTA and 20% v/v glycerol. Following low-speed centrifugation, the cell 
pellet is resuspended in 50 ml of the same buffer but also containing 2 mM of 'Pefabloc' a 
serine protease inhibitor. Cells are evenly suspended using a glass homogenizer and then 
disrupted at 10000 psi using a Constant Systems (Budbrooke Rd, Warwick, U.K.) Basic Z 

10 cell disrupter. The crude extract is centrifuged at ~ 30,000 g for 1 h and the pellet discarded. 
Protamine sulphate (salmine) is added to a final concentration of 0.2% , mixed and the 
solution left to stand for 30 min. Precipitated material is removed by centrifugation for 30 
min at ~ 30,000 g. Aristar grade ammonium sulfate is added to a final concentration of 40% 
of saturation, stirred for 30 min and then centrifuged at ~ 27,000 g for 30 min. The pellet is 

15 resuspended in - 10 ml of the same buffer as used for cell disruption, further ammonium 
sulfate is added to bring the solution to ~ 70% of saturation, the solution is stirred for 30 min 
and centrifuged again to yield a pellet which is resuspended in - 1 5 ml of S200 buffer (10 
mM Hepes/ KOH (pH 7.8) containing 1 mM DTT, 1 mM EDTA and 20% v/v glycerol). 
This is filtered (0.45 micron) loaded and chromatographed down a K26/ 60 column 

20 containing Superdex 200 equilibrated with S200 buffer. EPSPS-containing fractions 
detected on the basis of EPSPS enzyme activity are combined and loaded onto an xkl6 
column containing 20 ml of HP Q-Sepharose equilibrated with S200 buffer. The column is 
washed with S200 buffer and then EPSPS eluted within a linear gradient developed from 
0.0M to 0.2M KC1 in the same buffer. EPSPS elutes within a single peak corresponding to a 

25 salt concentration at or below approximately 0.1 M. EPSPS-containing fractions detected on 
the basis of EPSPS enzyme activity are combined and loaded onto a HiLoad xk26/60 column 
of Superdex 75 equilibrated with Superdex 75 buffer (25 mM Hepes/ KOH (pH 7.5) 
containing 2 mM DTT, 1 mM EDTA and 10% v/v glycerol. EPSPS-containing fractions 
identified on the basis of enzyme activity are combined and loaded onto a 1ml column of 

30 MonoQ equilibrated with the same, Superdex 75 buffer. The column is washed with starting 
buffer and EPSPS eluted as a single peak over the course of a 15 ml linear gradient 



WO 02/26995 



PCT/GB01/04131 



-21- 

.• 

developed between 0.0 and 0.2M KC1. EPSPS is obtained near (>90%) pure at this stage in 
the purification. Optionally, EPSPS is further purified by exchange into Superdex 75 buffer 
containing 1.0 M (Aristar) ammonium sulphate and loading onto a 10 ml column of phenyl 
sepharose equilibrated in the same buffer. EPSPS is eluted as a single peak early during the 

5 course of a linear gradient of declining ammonium sulphate developed between 1 .0 and 0.0 
M ammonium sulphate. 

The purified mutant form of Brassica napus EPSPS, obtained by these or by similar 
methods and assayed as described above in the presence of 2 mM shikiniate-3-Pi, has an 
apparent Vmax of ~ 5 umol/ min/ mg and a Km for PEP of ~ 25 uM. At 40 uM PEP, the 

10 IC50 value for the potassium salt of glyphosate is ~ 0.6 mM. The estimated Ki value for 
potassium glyphosate of the mutant EPSPS is ~ 0.45 mM. 
Example 8. Cloning of genomic DNA encoding Brassica napus EPSPS 

A Brassica napus genomic DNA fragment (SEQ ID No.l), encoding EPSPS is 
known in the art (Gasser and Klee, 1990). PCR primers BnEPSPS5 (SEQ ID No.2) and 

15 BnEPSPS3 (SEQ IDNo.3) are used to amplify the EPSPS gene from genomic DNA isolated 
from Brassica napus var. Westar. 

Pfu polymerase is used to perform the PCR using the following conditions: 
94 °C 5 min 
94 °C lmin 
20 55 °C 30s 
72°C12min 

The resulting 3.8 Kb product is cloned into pCR4Blunt-Topo and sequenced using 
the following primers: M13Forward, M13Reverse (Invitrogen), BnFl (SEQ ID No.4), BnF2 
(SEQ ID No.5), BnF3 (SEQ ID No.6), BnF4 (SEQ ID No.7), BnF5 (SEQ ID No.8), BnF6 

25 (SEQ ID No.9), BnF7 (SEQ ID No. 10), BnF8 (SEQ ID No.ll), BnF9 (SEQ ID No.12), 
BnFlO (SEQ ID No.13), BnFll (SEQ ID No.14) and BnF12 (SEQ ID No.15). 
Example 9. Introduction of mutations into genomic DNA encoding BnEPSPS. 

The desired Thr-Ile and Pro-Ser mutations are introduced into the B. napus EPSPS 
gene by PCR using oligonucleotide primers containing codon changes so as to introduce the 

30 desired amino acid changes. Primers BnXho (SEQ ID No.16) and BnMUTBot (SEQ ID 

No.17) are used to amplify a desired region of EPSPS from Westar genomic DNA. Likewise, 
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primers BnMutTop (SEQ ID No.18) and BnNde (SEQ ID No.19) are used to amplify an 
additional region from Westar genomic DNA. 1 ul of each of the PCR products is mixed and 
overlapping complementary sequences allowed to anneal. The products are then joined by 
PCR using primers BnXho and BnNde. The resulting PCR product is cloned and sequenced 
5 to check for the inclusion of the double mutation. The Xhol/Ndel fragment in the BnEPSPS 
clone is excised and replace with the cloned Xhol/Ndel fragment containing the double 
mutant. 

Example 10, Construct for expression of mutant B.napus EPSPS in plants having the 
Rubisco leader sequence upstream of the gene encoding mutant Brassica napus EPSPS. 

10 The tobacco rubisco leader is introduced into the construct by PCR. Primers 

BnRBSEP (SEQ ID No.20) and Brasrl2 (SEQ ID No.21) are used to amplify a product from 
50 ng wildtype B. napus genomic DNA as template using Pfu-Turbo polymerase. The 
resulting PCR product is cloned into pCR4 blunt TOPO and clones are identified by 
restrictions digest that are orientated such as it may be excised as Notl (from vector) and 

15 Xhol . The rubisco leader is then introduced into the mutated BnEPSPS clone as Notl/Xhol . 
Example 11> Identification of the Brassica napus EPSPS promoter (useful for 
expression of EPSPS in plants) by genome walking 

Increased lengths of upstream or downstream sequence of any plant gene may be 
obtained using the genome walking kit as supplied and directed by Clontech. 

20 The primers depicted in SEQ ID Nos 22 and 23 (Bras rl 1 and Bnapus start rev) are used to 
obtain upstream sequence of the Brasicca napus EPSPS gene using Brasicca napus genomic 
DNA according to the methods described in the protocol accompanying the genome walker 
kit. 

The Bras rl 1 (SEQ ID No. 22) primer is used in conjunction with the API primer 
25 supplied in the genome walker kit and the Bnapus start rev primer is used with the AP2 
primer supplied with the genome walker kit. . Products are cloned into pCR- TOPO using a 
TOPO TA Cloning kit from Invitrogen according to the manufacturers instructions. PCR 
using Ml 3 forward and reverse primers is performed using Ready to Go Beads from 
Pharmacia to ascertain which clones contain inserts of the correct size. A number of these 
30 clones are used to make DNA preps and these DNAs are then used to determine the 
nucleotide sequence of the inserts. 
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A second set of primers (SEQ ID Nos.24 and 25 respectively) is used for a further 
walk as shown below: 

Genome walker 1 (SEQ ID No.24) with the API primer 
Genome walker 2 (SEQ ID No.25) with the AP2 primer 
5 Additional genomic digests are performed in addition to those recommended by 

Clontech's protocols. These digests include Alu I, Hae HI, Hinc II and Rsa I. These digested 
DNAs are then treated identically to the other digests performed as described by the 
protocols accompanying the Genome walking kit. This is done to increase the chance of 
getting a correct and long PCR product Again the PCR products are cloned into pCR- 

10 TOPO using a TOPO TA Cloning kit from Invitrogen according to the manufactures 

instructions and sequence determination is performed on random clones obtained from each 
cloned product that contain inserts of the correct size. The sequence of the inserts are 
determined. Approximately Ikb of sequence upstream form the translational start site of the 
Brasicca napus EPSPS is obtained that is contiguous with the published sequence. 

15 Example 12. Constructs for expression of mutant Brassica napus EPSPS in plants 

The 35S enhancer is amplified from vector pMJB 1 (Figure 1) by PCR using primers 
CaMV343 and CaMV46C (SEQ ID Nos.26 and 27 respectively) 

The PCR product is cloned into vector pCR4Blunt-TOPO and sequenced. The 
desired promoter, obtained as described in the previous example is amplified from B. napus 

20 Westar genomic DNA using a 5' primer containing a Sphl site and BnGUSXma (SEQ ID 
No.28) 

The PCR product is digested directly with Sphl and Xmal and ligated into the 
pCR4Blunt-TOPO vector containing the 35S enhancer as Sphl/Xmal (partial digest with 
Sph 1). The CaMV35S enhancer fused to the BnEPSPS promoter is then excised from pCR4- 

25 Blunt using Notl (from vector) and Xma 1 and cloned into the PCR4Blunt vector containing 
the tobacco rubisco leader and mutated B. napus EPSPS gene. The FMV enhancer (SEQ ID 
No.29) or the CaMV35S enhancer (SEQ ID No.30) is synthesised chemically and subcloned 
into vector pCR4 Blunt TOPO. The FMV enhancer is excised from PCR4-Blunt as 
Notl/Sacl and ligated into the CaMV35S enhancer/B. napus promoter / rubisco leader / B. 

30 napus EPSPS as Notl/Sacl. The final construct is shown in Figure 2. The whole expression 
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cassette may be excised using EcoRl and ligated into a suitable vector for plant 
transformation. 

Example 13. Plant Transformation and Regeneration, 

The expression cassette is excised from the pFCBnEPSPS vector using EcoRl and 
5 ligated into, for example, a binary vector pBinl9 at the unique EcoRl restriction site. The 
binary vector, containing the desired expression cassette transformed into Agrobacterium 
tumefaciem strain LB A 4404 using the freeze thaw method of transformation provided by 
Holsters et a/., 1978. Tobacco transformation and whole plant regeneration is performed 
using Nicotiana tabacum var. Samsun according to protocols in Draper et al (Plant Genetic 
10 Transformation, Blackwell Sci. Pub. 1989). Transformation events are selected on MS-media 
containing kanamycin. Alternatively pFCBnEPSPS, or other constructs capable of 
delivering glyphosate resistance in plants by the expression of a glyphosate resistant EPSPS 
gene from soybean may be introduced into plants directly without using Agrobacterium 
mediated techniques as described for Soybean. 

15 Constructs are transformed into regenerable embryogenic soyabean tissues using 

either Holistic type approaches (e.g Santarem ER, Finer, J.J (1999) 'Transformation of 
soyabean {Glycine max (L.) Merrill) using proliferative embryogenic tissue maintained on a 
semi-solid medium' In vitro Cellular and Developmental Biology-Plant 35, 451-455; USP- 
5,503,998, USP 5830728 )or via infection with Agrobacterium (e.g. USP-5,024,944, USP- 

20 5,959,1 79). Regenerable embryogenic soyabean tissues are derived, for example, from the 
cotyledons of immature embryos or other suitable tissues. 

Proliferative embryogenic tissue can, for example, be maintained on a semi-solid 
medium. Such tissue, is, for example obtained in the following way. Immature zygotic 
embryos which are 3- 4 mm long are isolated from pods of, for example, Glycine max (L.) 

25 Merrill, 2-3 weeks after flower formation. Pods can be checked for the presence of embryos 
of the coiTect length and maturity by 'backlighting*. Pods are then sterilized. Immature 
embryos are removed and the axis removed from each. Immature embryos are then plated on 
'D40-Lite' semi-solid (0.2% gelrite) MS salts medium at pH 7.0 containing B5 vitamins, 3% 
sucrose and 40 mg/1 2,4-D for 3-4 weeks. For proliferation of embryos the material is then 

30 transferred to *D20' MS salts medium at pH 5.7 containing B5 vitamins, 3% sucrose, 20 
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mg/1 2,4-D and 0.2% Gelrite. Material with bright green globular proliferative embryos is 
selected and subcultured every 2-3 weeks. 

For bombardment, 20-25 clumps/ plate of tissue are selected (subcultured 4-5 days 
prior to bombardment) and arranged in the centre of the dish containing D20 medium. The 
tissue is dried for' 15 min by uncovering for 15 minutes under a sterile hood. Gold particles 
coated in DNA construct (coated, for example, using methods described in the references 
above) are twice bombarded into the tissue on D20 medium using any one of a large number 
of commercially available guns. By way of further example a PDS1000 particle gun is used. 
Particles may be prepared and coated with DNA in a similar manner to that described by 
Klein et al 1987, Nature, 327, 70-73. Alternatively, for example, 60 mg of gold or tungsten 
particles (~ 1.0 urn) in a microcentrifuge tube are washed repeatedly in HPLC-grade ethanol 
and then, repeatedly, in sterile water. The particles are resuspended in 1 ml of sterile water 
and dispensed into 50 ul aliquots in nucrocentrifuge tubes. Gold particles are stored at 4 C, 
tungsten particles at - 20 C. 3 mg of DNA are added to each aliquot of (defrosted) particles 
and the tubes are vortexed at top speed. Whilst maintaining near continuous vortexing, 50 
ul of 2.5M CaCl 2 and 20 ul of 0.1M spermidine is added. After 10 minutes of further 
vortexing, samples are centrifuged for 5 seconds in an eppendorf microcentrifuge, the 
supernatant is drawn off and the particles washed in successive additions of HPLC-grade 
ethanol. The particles are thoroughly resuspended in 60 ul of ethanol and then dispensed in 
10 ul aliquots onto the surface of each macrocarrier to be used in the PDS1000 particle gun. 
Components of the PDS1000 particle gun are surface sterilised by immersion in 70% ethanol 
and air-drying. Target plates prepared, as described above, with tissue arranged into an ~ 
2.5 cm disc are placed 6 cm from the stopping screen. Suitably chosen rupture discs are then 
used for bombardment 

One week after bombardment, all tissue clumps are transferred onto D20 medium, 
buffered to pH 5.7, containing a suitable selective concentration of glyphosate between 0.05 
and 5 mM. After an additional 3-4 weeks all tissue is transferred to fresh D20 medium 
containing an increased concentration of glyphosate within this concentration range. After a 
further 3-4 weeks, living tissue is selected and subcultured on every 3-4 weeks in similar 
D20 medium containing glyphosate. Alternatively, in the case that some other selectable, 
marker than glyphosate is also present then selections may be made as appropriate (e.g using 
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increasing concentrations of hygromycin). Growing sections are thus maintained and, given 
enough tissue, may be analysed by PCR to confirm that they are transgenic for the desired 
DNA. 

In order to develop and mature embryos, tissue clumps are placed onto M6 medium 

5 which comprises MS salts at pH 5.7 containing B5 vitamins, 6% maltose and 0.2% gelrite.. 
6-9 clumps are placed in a tall dish at 23°C. After 3-4 weeks, embryos elongate and can be 
separated and transferred to another round of incubation on M6 medium. After 4-6 weeks, 
embryos are cream-coloured and ready for desiccation. 9 such cre^m-coloured embryos are 
placed in a dry Petri dish, sealed with parafilm and placed onto a shelf for 2-3 days. 

10 Embryos should be somewhat flaccid and not "crispy-crunchy". 

Dessicated embryos can be germinated by plating onto OMS (growth regulator-free 
MS medium). Following germination which normally occurs within a week plants are 
transferred to larger boxes and, once there is sufficient root and shoot formation, thence to 
soil. To prevent fungal contamination it is advisable to wash OMS from the roots with 

15 distilled water. Plants may be kept and grown under high humidity and, initially, under 24 
hour lighting. Plants may be grown until about 2 feet tall under 24 hour lighting and then 
encouraged to flower and form pods through a shift to a 1 6 hour lighting regime. Seeds are 
collected and progeny grown on, crossed and backcrossed into order to move the transgenes 
into the desired plant background using the normal methods of plant breeding. Plants are 

20 routinely analysed for the presence and expression of transgenes using the normal methods of 
molecular biology including analysis by PCR, Southern, Western, ELISA and enzyme assay 
techniques. 

Example 14, Methods of obtaining genes encoding stable glyphosat e resistant EPSPs 
In one embodiment of the current invention, glyphosate resistant EPSPS genes 

25 suitable for expression in E.coli are constructed, as for example, described herein 

where, in the coding sequence a first position is mutated so that the residue at this position is 
He rather than Thr and a second position is mutated so that the residue at this position is Ser 
rather than Pro, the mutations being introduced into EPSPS sequences which comprise the 
following conserved region GNAGTAMRPL in the wild type enzyme such that the modified 

30 sequence reads GNAGIAMRSL. 
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Stable glyphosate-resistant EPSP synthases of the current invention and genes 
encoding these are then obtained by generating a wide range of variants by a process of 
random or partially random mutagenesis (for example using chemical or UV mutagenesis, by 
using a non-proofreading DNA polymerase (e.g. in a strain such as XL1 red) or by using any 

5 one of a number of 'directed evolution' approaches (e.g as reviewed by Kuchner and Arnold, 
1997 in TO1ECH, 15, 523-530 or described by Stemmer (1994) in PNAS, 91, 10747- 
10751) and selecting genes expressing EPSPSs which are stable and which remain 
glyphosate resistant Suitable means of selection include, for example, expression of 
mutated genes (e.g from a weak constitutive promoter) in an Aro A- strain of E.coli grown at 

10 elevated temperatures (e.g 37-42 C) and selecting those transformants most capable of 

growth in minimal medium (optionally also containing glyphosate). Alternatively, anon-aro 
A host strain might be used (preferably e.g. rec A) provided that the medium contains a 
selective concentration of glyphosate. The skilled man will recognise that many methods of 
mutagenesis and selection could be used within the ambit of the current invention. 

15 Example 15, Stable glyphosate-resistant EPSPSs and genes encoding them 

The stable double mutant form of Brassica napus EPSPS is one example of the 
current invention. The skilled man will recognise that other examples of such EPSPSs of the 
current invention and genes encoding them can be cloned, isolated and characterised using 
methods the same or similar to those described above. 

20 In EPSPSs and the EPSPS gene sequences of the current invention, the coding 

sequence at a first position is mutated so that the residue at this position is De rather than Thr 
and a second position is mutated so that the residue at this position is Ser rather than Pro, the 
mutations being introduced into EPSPS sequences which comprise the following conserved 
region GNAGTAMRPL in the wild type enzyme such that the modified sequence reads 

25 GNAGIAMRSL. 

EPSPS protein sequences of the current invention are further characterised in that 
they DO NOT comprise a region having the sequence K(A/V)AKEAVWGCGGKFPVE 
EPSPS protein sequences of the" current invention are further characterised in that 
they comprise at least one of the amino acid sequence motifs listed below in (i)-> (viii). 

30 (i) ALLMZAPLA whereZisNOT S orT 

(ii) EBEIZDKL where Z is NOT V 
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(iii) FG(V/I)(K/S)ZEH where Z is NOT V 

(iv) AL(K/R)ZLGL where Z is NOT R 

(v) GLXVEZiDXZjXXXACIAOV where Z, is NOT T and/or 2^ is NOT E 

(vi) ITPPZiKCLAOCK/hOZi where Z, is NOT K and/or Z, is NOT T 
5 (vii) HZ(D/N)PGCT where Z is NOT N or L 

(viii) (D/N)YFXVLXZXX(K7R)H where Z is NOT R 

wherein X is any amino acid and Z, Zl, and Z2 are any amino acid other than those 
indicated. 

Furthermore it is preferred, for EPSPS genes and proteins of the current invention, 
10 that the protein sequence comprise at least one of the sequence motifs i), ii) iii), v) and vi), 

preferably, at least two of these. 

Furthermore, in amino acid sequence motif i) Z is preferably A, in sequence motif ii) 

Z is preferably I, in sequence motif iii) Z is preferably A, in sequence motif iv) Z is 

preferably K, T or A, in sequence v) Z x is preferably R or A and less preferably D or E whilst 
15 Z2 is preferably V or A and less preferably T, in sequence motif vi) Z x is preferably E or A 

whilst Zj is preferably P, I or V, in sequence motif vii) Z is preferably R or K and in 

sequence motif viii) Z is preferably T or S and less preferably is Q. 

Furthermore, some of the EPSPS protein sequences of the current invention are 

characterised in that they comprise at least one of the following sequences: 
20 (W)VEGCGG(I/L/Q)FP(V/A/r)(S/G) or 

(IAOWGCGG(I/L/Q)FP(V/AAT)E or 

(W)WGCGG(yL/Q)FP(V/AA)(S/G) or 

(I/V)W GCGGKFP(V/A/T)(S/G)or 

(IAOVEGCGGKFP(V/A/T)E or 
25 (IAOVEGCGGCI/L/COFPCV/AnOEor 

0A0VEGCGGKFP(V/A/T)(S/G) 

Furthermore EPSPS sequences of the current invention are preferably derived from 

dicotyledenous plants 

Example 16. Stable glyphosate-resistant EPSPSs derived from soyabean 
30 Preferred examples of the current invention are the EPSPS protein sequences and the 

DNA sequences which encode these which are derived from soyabean ( see for example 
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EPSPS gene SEQ ID Nos 33 and 34) and which are then optionally modified according to 
Example 15 to yield a stable glyphosate resistant EPSP synthase. 

Particularly preferred examples are EPSP synthase genes derived from soyabean in 
which:- 

5 (a) the coding sequence at a first position is mutated so that the residue at this 

position is lie rather than Thr and a second position is mutated so that the residue at 
this position is Ser rather than Pro, the mutations being introduced into EPSPS 
sequences which comprise the following conserved region GNAGTAMRPL in the 
wild type enzyme such that the modified sequence reads GNAGIAMRSL, 

10 And 

(b) the DNA coding sequence may be further modified in accord with Example 
15 so that me encoded protein comprises at least one and preferably two or more of 
the amino acid sequence motifs ii), iii), v) and viii) . 
Kxamnle 17 Cloning; of a soybean partial lengt h cDNA encoding EPSPS. 
15 Soybean RNA is isolated using TriZol™ reagent as described by Gibco BRL and first- 

strand cDNA synlhesised using Superscript II (Gibco BRL) using protocols provided by the 
manufacturer with the degenerate primer EPSPS 10 reverse (SEQ ID No.35). PCR is 
performed using ready to go PCR beads supplied by AmershamPharmacia™ according to the 
protocol supplied by the manufacturer using the reverse transcribed soybean first-strand 
20 cDNA product as a template and the primers EPSPS 4 forward (SEQ ID No.34) and EPSPS 
10 reverse (SEQ ID No.33). 
The following PCR conditions are used:- 

1. 94°C 3 minutes for 1 cycle 

2. 94°C 45 seconds 
25 50°C 30 seconds 

72°C 1 minute for 25 cycles 

3. 72°C 10 minutes for 1 cycle. 

The product obtained of approximately 1 kb is cloned into pCR2.1 using a TOPO TA 
cloning Kit (Invitrogen) according to the manufacturer's recommendations. Colonies are 
30 picked at random and are selected for further work via plasmid preparation with the QIAprep 
Mini-Prep Kit (Qiagen), followed by digestion with Eco RI to show the size of the insert in 
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these clones. Clones containing the expected lkb insert are progressed. The polynucleotide 
sequence of selected clones is determined using an ABI 377 automated sequencer. Database 
searching is performed using the Blast algorithm (Altschul et aL, 1990) and indicates that 
the majority of sequenced clones contain partial length cDNAs exhibiting high homology 
toward known plant EPSPS sequences. 

Example 18 Isolation of a genomic polynucleotide sequence encoding soybean EPSPS. 

A soybean genomic library constructed in Lambda Fix II is purchased from 
Stratagene and 1,000,000 clones are plated out according to the suppliers protocols. Hybond 
N+ filters are used to make lifts from the library plates and these were processed according 
to the protocols supplied by AmershamPharmacia and Molecular Cloning - a laboratory 
manual (Sambrook et aL, 1989). UV cross linking of the DNA to the membrane is carried 
out using a UV Stratalinker (Stratagene). The soybean EPSPS cDNA probe is prepared by 
digesting the clone containing the soybean EPSPS partial length cDNA sequence with Eco 
RI and purifying the 1 kb product by agarose gel electrophoresis. The probe is labelled with 
32 P dCTP using a Ready-To-Go™ DNA labelling kit (AmershamPharmacia). The filters are 
prehybridised and hybridised in Rapid-Hyb Buffer (Amersham Pharmacia). Washes are 
performed using 0.1 x SSPE, 0.5% SDS at 65 °C. The filters are then wrapped in Saran wrap 
and exposed to film in a cassette containing an intensifying screen as described in Molecular 
Cloning - a laboratory manual (Sambrook et al., 1989). 

Hybridising plaques are picked into SM buffer, replated at lower density and 
rescreened through 2 further rounds of purification. Phage DNA is recovered from plaque 
pure phage stocks using methods well known in the art (Sambrooke et aL 1989). The 
recovered phage DNA is digested with Not 1, and the genomic DNA subcloned into 
pBluescript. The resulting vector is transformed into TOP 10 E.colL The resulting clones are 
sequenced using the Genome Priming System (NEB) using protocols provided by the 
manufacturer. Briefly, sufficient batches of 96 GPS events are selected and DNA prepared 
using a Qiagen™ Biorobot. The DNA was sequenced with the GPS-S and GPS-N primers as 
supplied by NEB using an ABI 377 automated sequencer. Sequence data are analysed using 
Seqman (DNASTAR Inc). The gene encoding EPSPS is identified by homology to known 
EPSPS genes using the Blast algorithm. The sequence of two contigs containing sequences 
showing high homology to known EPSPS are given in SEQ ED No.37 and SEQ ID No.38 
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and are termed GmEPSPSl/14 and GmEPSPS12 respectively. A schematic representation of 
these genes is given in Figure 3 and Figure 4. 

The person skilled in the art will know that there could be other EPSPS genes present 
in soybean and that these may be isolated and the nucleotide sequence detennined using 
similar methodologies as described in this example. 

Example 19 Production of plant transformation vector harbourin g t he s oybean EPSPS 
gene. 

The two enhancer combinations FMV35S90 (SEQ ID No.37 and FMV35S46 (SEQ 
ID No.38) are synthesised chemically and subcloned into vector pTCVlOOl (Figure 5) using 
Hnxdmi Pac 1. The soybean EPSPS terminator, and a small region of the coding sequence, 
are obtained by PCR using the primers GmTermBam (SEQ ID No.39) and GmTermEco 
(SEQ ID No.40) using clone GmEPSPS 1/14 as template. The PCR product is cloned into 
pCR4-Blunt-TOPO, sequenced, and excised using Pstl and EcdRl and ligated into the 
pTCVlOOl vector containing the FMV/CaMV35S enhancers. Finally, the Pacl:BamHl 
DNA fragment containing the majority of 1he soybean EPSPS gene in either GmEPSPS 1/14 
is excised and cloned into the pTCVlOOl vector comprising both the enhancers and 
tenninator. 

Example 20 Introduction of mutation into EP SPS gene. 

The desired Thr to He and Pro to Ser mutations required to increase tolerance to 
glyphosate are inserted into me construct in the following manner. Primer BnMUT2PST 
(SEQ ID No.41), which contains the nucleotide changes that give rise to the desired amino- 
acid mutation, is used in conjunction with primer BnMUTKPN (SEQ ID No.42) in PCR with 
pfu-TURBO polymerase (Stratagene) whh GmEPSPS 1/14 as template. The PCR product, 
now containing the desired mutation, is cloned into pCR4-Blunt-TOPO and sequenced. It is 
then excised as Kpn 1 Pst 1 and used to replace the corresponding fragment in the construct 
in pTCVlOOl containing WT sequence. The final vector is termed pTCVGMEPSPS (Figure 
6). 

Example 21 Introduction of 5' UTL. 

An additional 5' UTL, such as that from the small subunit of ribulose Diphosphate 
carboxylase/oxygenase (Rubisco) or tobacco glucanase can be inserted into the construct by 
PCR In addition the context surrounding the AUG translation initiation codon is optimised 
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to give the consensus Kozak sequence ACCAUGG. With regard the SSU rubisco leader, 
primers GmRubPac (SEQ ID No.43) and GmRubbot (SEQ ID No.44) are used to introduce 
the soybean rubisco 5' UTL into 5' UTL of the soybean EPSPS gene. GmRubTop (SEQ ID 
No.45) and GmRubKpn (SEQ ID No.46) are then used to introduce the soybean rubisco 5' 
5 UTL into the first exon of the soybean EPSPS gene. The two PCR products are then joined, 
by PCR using primers GmRubPac and GmRubKpn and the resulting product cloned into 
pCR4Blunt-TOPO and sequenced. Once the authenticity of the PCR product is confirmed it 
is excised from the PCR4 vector as Pacl:Kpn\ and used to replace the PacVKpnl restriction 
fragment in vector pTCVGMEPSPS. 

10 These examples outline the construction of a DNA sequence comprising two 

enhancers fused to a soybean EPSPS gene containing specific mutations in the coding 
sequence that is capable of producing a glyphosate resistant EPSPS enzyme in plants. The 
person skilled in the art will know that enhancers may be introduced at various distances up 
stream from the promoter and transcriptional start site to act to enhance the rate and/or 

15 amount of transcription of the double mutant EPSPS soybean gene in plant cells. The 
distance may have to be determined empirically for each individual gene. This involves 
making a number of constructs where the enhancers are fused at different distances from the 
transcriptional start of the double mutant EPSPS soybean gene and introducing these 
constructs into plant material and assaying the material for resistance to selecting 

20 concentrations of glyphosate. The person skilled in the art will also know that enhancer 

choices are not restricted to CaMV 35S and FMV enhancers, but may also include enhancers 
from any gene that is highly expressed in plants including inter alia, enhancers from actin 
genes, ubiquitin genes, mannopine, nopaline and octopine synthase genes, genes from 
commelina yellow virus and other plant viruses expressed in plants such as potato virus Y 

25 and X coat proteins. The choice of 5' UTR may not be restricted to soybean small subunit 
rubisco and leader from other genes may be used. 
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Example 22 Plant Transformatio n and Regeneration. 

The expression cassette is excised from the pTCVGMEPSPS vector using Xma 1 and 
ligated into a binary vector P Binl9 at the unique Xma 1 restriction site. The binary vector, 
containing the desired expression cassette transformed into Agrobacterium tumefaciens strain 
5 LBA 4404 using the freeze thaw method of transformation provided by Holsters et al, 1978. 
Tobacco transformation and whole plant regeneration are performed using Nicotiana 
tabacum var. Samusun according to protocols in Draper et al (Plant Genetic Transformation, 
Blackwell Sci. Pub. 1989). Transformation events are selected on MS-media containing 
kanamycin. 

10 Alternatively PTCGMEPSPS, or other constructs capable of delivering glyphosate 

resistance in plants by the expression of a glyphosate resistant EPSPS gene from soybean 
may be introduced into plants directly without using Agrobacterium mediated techniques as 
described for Soybean in a number of references including the following: Physiol. Plant. 
(1990), 79(1), 210-12; USP-5,968,830; WO-0042207; andUSP-5,959,179 and as described 

15 in Example 13. 

Example 23 Analysis of Transgenic Plants 
PCR analysis of transfonnants 

Leaf samples were taken from transformed lines and DNA extracted according to known 
methods. Oligonucleotide primers are designed to specific regions within the transgene to 
20 enable its detection in the plant material tested. 
RNA analysis 

The presence ofmRNA encoding the transgene is detected within the plant using Northern 
Blot hybridisation. Total RNA is extracted from leaf tissue using Tri-reagent and protocols 
provided by the manufacturer (Sigma™). Blots are prepared using existing procedures and 
25 probed using radio-labelled soybean EPSPS cDNA. 
Protein Analysis 

The presence of recombinant protein in the transgenic plant is determined using Western 
blotting procedures with antibodies raised to recombinant soybean EPSPS using standard 
protocols. 
30 Enzyme assays 
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Resistant EPSPS activity is detected in plants using 14 C radiolabeled substrates and detection 
of products by HPLC. 
Herbicide tolerance tests 

Following tissue culture, transgenic plants are transferred to 5 inch pots containing John 

5 Innes potting compost no. 3. The plants are allowed to acclimatise (approx. 2 weeks) in the 

greenhouse and formulated glyphosate applied at various concentrations to the aerial tissue 

using a track spayer. Visual assessment of the transformed plants to identify those that are 

resistant to glyphosate is performed 21 days post application. 

Example 24 Introduction of plastid targeted pyruvate orthophosphate di-kinase 
10 (PPDK) or phosphoenolpyruvate synthase (PPS) constructs. 

The cDNA encoding PPDK from maize (EMBL Accession J03901), containing its 
autologous transit peptide sequence, is amplified by RT-PCR from maize tissue using 
primers ZMPPDK1 (SEQ ID No. 47) and ZMPPDK2 (SEQ ID no. 48). The RT-PCR product 

15 is cloned into the vector pCR2:l TOPO and sequenced to check for authenticity. The maize 
PPDK gene is excised from the pCR2.1 vector, as aNco VKpn 1 fragment and ligated into 
similarly digested pMJBl (Figure 1). The plant expression cassetee is then subcloned into 
the pTCVGMEPSPS cassette (Figure 6) prior to further plant transformation. Other PPDK 
genes, such as that encoding a cold-stable PPDK from Flaveria brownii (EMBL accession 

20 AAQ94645) or form rice can be engineered using a similar strategy. Likewise, the gene 
encoding PPS in K coli (EMBL Accession X59381) is amplified by PCR using oligos 
ECPPS1 (SEQ Id No. 49) and ECPPS2 (SEQ ID No. 50). The PCR product is cloned into 
plant expression cassette containing a plant operable promoter (such as the cauliflower 
mosaic virus 35S promoter), a chloroplast transit signal sequence (such as that from the small 

25 subunit of rubisco) and a terminator (such as NOS). 

Sequence listing 

Brassica napus genomic DNA encoding an EPSPS (SEQ ID No.l) 
gcatgcagatcttaaaggctcttttccagtctcacctaccaaaactataagaaaatccacttgctgt^ 
30 tacttaagacgtggcacattattattgactactaga 

cctctcccccccccactcaccaaactcatgttctttgtaaagccgtcactacaacaacaaaggagacgacagttctatagaaaag 
aaattcaatcaatggcgcaatctogcagaatrtgcca^ 
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10 



20 



acaaateaocttteto^^ 
ctgtaattcgcccggttaaggto^ 
gtetcattaagctacccggatcc^^ 
ctttgctgtgagattttgggaacta^^ 

caaaatttttggaggttataggc^gttacaccattctagtctgcatctttcggtttga^ 



aaca_ _ 

gggaatgcaggaacagccatgcgtccartcaccgrtgcagttacagctgcaggtggcaacgcg^ 

attgtcaagaaattgatcttgtgtttgatgcttttag^ 

aggagatttggttgttggt^gcagc^^^ 
ggccttcccggtggaaaggtgatcttcacatttactctatgaattgtttgcag^ 

tttagtttgttgttalat^^ 

gtgggcattactaagtgaaacattagcctattaacccccaaaattmgaaaaaaatttagtaM 
gaaaaacttttaataaatcgtctacagtccxaaaaatcttagagccgg 
15 attttcaggtgaagctttctggatcgatcagtagtcagtacttgartg(xcto 

gagatcattgataaactgatatctgttccaMgttgaaatgacattgaagttgatgg^ 
ctgggatcgmcmgtcaagggcggtcagaaatacaagtaatgagttctt^ 
ccaaatggcaaaactgattcaggtcgcctggtaatgcttatgtagaaggtgatg^ 
actggtgaaactgttactgtcgaaggttgtggaacaactagcctocaggt^ 

ttatgttaagtgtcattagc^ 
attaattagcaaaatcaaacaaaa^ 

tgaaacagagggagtactaattttataataaaamgattagttcaaagttgtgtataacatgt^ 

tmgtgatgaatcccaaaggg^ 
actgtgactggaccatcaagag^^^ 
25 actctogcxgttgttgctctctttgccgatggtccaaccaccatcagaga^^ 

gattcatggttgcttaaactctatttggtcaatgtagtggctagctggagagt^ 
cttagaaaggtaagmccttttctctcatgctctctcattcgaagttaatcgttgcat 
tacagtggaagaaggttcagattattgtgtgataactccaccagcaaaggtgaaaccggcggagattgata^ 

tggcgatggcgttctcgrtgcagcrtgtgctga^^ 
ccaagtccttgaaagtatcacaaagc^^^ 



30 



aatcagttttgggccttaat 
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aaagggtttaggaagctgcagcgagatgattgtttttggtcgatcatctttgaaaatgtgm^ 

acactaagaaacactttttgattttctattacac^^ 

ttttgcrtttaatggaaaggacacagtt^ 

atgagtgatgagtaatgtg^gaatatgtgatgttaatgtgtttatagtagataagtggacaa^ 
5 caatacgtggacatggactcatgttactaaaattatac^ 

gctcaattgataaattatctgcagttaaacttcaatcaagatgagaaagagatgatattgtga^ 

agcgtttaccttttgtcggagagtaatgaattc 

BnEPSPSS (SEQIDNo.2) 

gcatgcagatcttaaaggctcttttccagtc 
10 BnEPSPS3 (SEQ'ID No.3) 

gaattcattactctccgacaaaaggtaaacgc 

BnFl (SEQlDNo.4) 

gcgcaatctagcagaatctg 

BnF2(SEQIDNo.5) 
15 tecttcttgccgctctatct 

BnF3 (SEQIDNo.6) 

gcgttgaagaagctggggct 

BnF4(SEQIDNo.7) 

acctataggagatttggttg 
20 BnF5(SEQIDNo.8) 

actaagtgaaacattagcct 

BnF6(SEQIDNo.9) 

gttccatatgttgaaatgac 

BnF7(SEQIDNo.lO) 
25 tccactctgaatcatcaaat 

BnF8 (SEQIDNo.ll) 

gtaagaatctaagctcattc 

BnF9 (SEQIDNo.12) 

tcatggttgcttaaactcta 
30 BnFl 0 (SEQ ID No. 1 3) 

cagcttgtgctgatgttcca 
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BnFll (SEQIDNo.14) 

cgatcatctttgaaaatgtg 

BnF12(SEQIDNo.l5) 

ctctctgttttgattccata 
5 BnXho(SEQIDNo.l6) 

cgcatcagcctcgagcttcttcgtg 

BnMUTBot (SEQ ID No.17) 

ggtgagtgaacgcatggctattcclgcat 

BnMutTop (SEQ JD No.18) 
10 atgcaggaatagccatgcgttcactcacc 

BnNde(SEQIDNo.l9) 

catttcaacatatggaacagatatc 

BnRBSEP (SEQ ID No.20) 
cccgggatattggaagttaaaggaaaagagagaaagagaaa^ 

15 c 

Brasrl2(SEQDDNo.21) 

ggacgtggaaacagaagctg 

Brasrll (SEQIDNo.22) 

tctcaaaccgaaagatgcag 
20 Bnapus start rev (SEQ ID No.23) 

ggttgaagcacaatctctgaagcttccatgg 

Genome walker 1 (SEQ ID No.24) 

gatcttgggtcattttaatgggccttctaac 

with the API primer 
25 Genome walker 2 (SEQ ID No.25) 

aaataaacccattaaccactaatgaaaaagggat 

with the AP2 primer 

CaMV343 (SEQ ID No.26) 

gcggagctcgagacttttcaacaaaggg 
30 CaMV46C (SEQ ID No.27) 

atcccgggaagcatgcgaaggatagtgggattgtg 



BNSOOCID; <WO 0226996A1J_> 



WO 02/26995 



PCT/GB01/04131 



-38- 

BnGUSXma (SEQ ID No.28) 
cgcccgggattgaatttbaaagc 
FMV enhancer (SEQ ID No.29) 
tctagagcagctggcttgtggggaccagacaaaaaaggaal^ 
5 gcaaagataaagcagattcctctagtacaa^ 
tgacgaacgcagtgacgaccacaaaagagctc 
CaMV35S enhancer (SEQ ID No.30) 

tgagactmcaacaaagggtaatatccggaaacctcctcggattccattgcccagctatctgtcact^ 
gaaggtggctcctacaaatgccalx^ttgcgataaagga 
10 c^cccacccacgaggagcatcgtggaaaaagaagacgttccaaccacgtcttcaaagcaagtggattgatgtgaMctccac 

taagggatgacgcacaatcccactatccttcgcaagacccttc 

Soyabean genomic DNA encoding an EPSPS (SEQ ID No. 3 1) 

aaaaaataccttttagtaaaataagagcgtrag^ 

gaaactaaggctaaaatcaactcgcctagtcaagctcgtccacaaaaataggtttttgaaagttt^ 
15 aaggatcatttgtaaggtccaacgccttaaaatgate^^ 

tctgatttcctctccaaaggagggtacgtaggagcaaaagccccgctmgtcgaccttaaaaaataaaaa 

acacaatttccacaattctagaaataaggttgttgtctttcgagacaaacgtaagaggtgctaatacctt^ 

cgaacttagaattttratffigaccgg^ 

ttggaaagcgcacccatgagcctcgccctcgctcgcccgcgaagggcacgttgcgacactaggttaataagaagtttagtcca 
20 ctatggacctcttaaggtgattatcaaagtggaagtggtgacctacatgttggcc^taccagctta 

gtatctcagctgaaacatgctatcaatccaaat^ 

atg^caaatggcacaagtgagattttgaatttctgatgttgteaaaatctcaggacat^ 

aatccaaactgacccaaaattctcaaatcacat^ 

gmcgatgattaaagcgttgttgttgatgatgaa^ 
25 cgaagatagagaaaaagatalatagatttttcaaggctcaaaccctaaaatcaccatgagagagaa^ 

cactatgagagagaatgagcagaacagaagcgtgagatagagaac 

tgagagaaagaacaaaggagcctacggtgtgagatgagagaatttgaaattcttaccatttaggtggaattt^ 
attaaaattattttaaaaaatgatg^cattttaaattctttaaaatctcatatccaa^ 
aaattatcttatttaaataccccatccaaacatagcgaaatgttcatgagaaggatcaagtggtttgga^ 
30 cagttcatggaatccttgatagatotttaa^ 

aWggagagtgttgtgtggttgtctcccctgtaggcaaaagcttcgatgtaaaggagttcaatgtccaa 
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attattgaaaatgaatgacacattttatttggttgaaatcaagaaataagcatgtggcaagcaacgggtatttgacaatt 
ggtgaatgragcaaaagaattaatgaactcct^ 

agcagttggaattcttggaatgacaagatatgaaatgaacataccaaccrtcctcttgtttctgm^ 
gcaaagtgcaaggtaatatgctcctcaacatcaattaacaaatgttcatgacatctcttaccagct^ 
5 ccaccgtcaagtttccttaaatgcaacattaatgttgctcatttcaaggagg 

attegteaaagpteartw^^ 
ttgatctgggcccagactattggtttacataatatcattmga^ 

ataagattatcctatttgtgcatgaatgcgaccaactatctcctgtttcagcattataaagtataaacaac^ 

aaagatattggacatacaagctaaaagtgatag^ 
10 caaattttttttgtttatttaaaataaagtggttcattaggtagctcttgaaaaaaaa 

ataatattctattataagtaaagaggaataattM^ 

attttattttgttaatagttgtcattgtgga^ 

tgttactaaatataaatttcatggtatactcaggacggttataacggaaatcaaataaataataaaatag 

agaalaatattaatttattetgtttacactatgtataacgaa 

15 gmcttattttattaccatmgttcacttctcaattaataaaataaaa^ 

tacagattmrtctattaactaattaattaaaaattatccttattttaagga^ 

Mgtacxaaaaagttttaaagcaattattataaa^ 

aaattatttaaaattmgaatgataatataataattttatcctct^ 

aaaracttataacaaaaaaagtaagtaaaaaa 

2Q ttatacattgataatataaattttttac^^ 

aacaatgamgcaattagatgataatataaaattattttacacactacatgtattaaactcaaactttta 

taactcaaaaaaaatgttacttat^^ 

actactctcttagacaccaaacaacccttcccaaactataatctaatccaaaagccatcattcatttt^ 
ttcaccaacttmcactcaattgttttggtgtaagcaattcga(atgtgttagtgto 
25 aacaaccactracaccaccaacccccataaccattcte^ 
tctcatttcagagagagagagagagacttcacagacc^ 

caagtgagcagagtgcacaatcttgctcaaagcactcaaattWggccattcttccaactccaacaaactcaaate 
attgaggccacgtxtttggggggcctraaaatctcgcatcccgatgcataaaaatggaagctttatg 
gaaattccggcgtgtttaaggtttctgcatcggtcgccgccgcagagaagccgtcaacgtcgccg^ 
30 agacttctcgggtaccatcacattgccagggtccaagtctctgtccaatcgaattttgctt^ 
tatttatttgWgtttgttgttgggtgtgggaataggagtttgatgtgtagagtggattttg 
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aagcatcatgtcccatgaaagaaatggacacgaaattaagtggcttatgatgtgaaatgaggatagaaatgtgtgtagggtt^^ 
gtag<^taagcatattcaatatctggattgatttggacg^ 

ctcctaaaaatcaagtgggagtagtatacatatctacagcaaatgtattaggtgaggcatttggcttctctattg^ 

cagt^tgtgaaaatcaatggttgatattccaatacattcatgatgtgttatttaMgtac^ 
5 aagattatttattttatcctctaaagtgactaattgagttgcttactttagagaagttggaccxattaggtga 

ttgaatata(^tctgagtcttgattatccaagtatggttgtatgaacaatgttagctctagaagataaaccctcc^ 

gaatgacatttcaagttccatgtatgtcactttcattctattatttttacaacttttagttactta^ 

trtgttttttmagggaacaactgttgtagac^cttgttgtatagtgaggatattcattacatgctt^^ 

gtgtggaagatgacaaaacaacxaaacaagcaatt^ 
10 aatttattccttggaaatgctggtactgi^tgc^ 

gttcagcatoatctttgaattgltcctcgtataacta^^ 

tcatcctttcttcttaaaaggaacctttaattatcaatgtatttatttaatatttaaat^ 

ctgagaaatgcacrtgagggtgttcctcatgatctacttcaacctctgttattattagatttt^ 

accatcttgatgttwiaaatatttcagctac^ 
15 aagcaacttggtgcagatgttgattgcmcttggcacaaa^ 

gtatggmggatttcatttagaataaggtggagtaacmcctggatcaaaattctaatttaagaagc^^ 

gactaagggtaggtttaggagttgggttttggagagaaatggaagggagagcaattttmcttcttct^ 

ttttttaagtaaaagaatataaagatagattagcataacttaatgttttaatctto 

aatatttgtwtocattccctttcccte^ 
20 ccttttcagagattatccctcacctttattatag(xtttctatttttaaacttcatalaga^ 

gttataccttcccctccttattctctcc 

gaaactgtotggatcagttagcagtcaatacttgactgcmgcttatggcagctcctttagctcttggtgat^ 

aaactgatttctgttccatatgttgaaatgacto^ 

ttggtcratggaggtcaaaagtacaagtaggtttctat^^ 
25 tcctattaggtctcctggcaatgcttttgttga^^ 

cactgttaatggctgtggcacaagcagtttaraggtat^ 

ttcaacccccttc^ctetttcccct^ 

attctaratgattatcctttttagtgaact^ 

atttcattaagagtgataaaattacattgtcaagtaa^^ 
30 cacaattcattagcttagtgacaattgtgatgtagatagccataatcttcxatgcaacttattat^ 

catgaattagagagttaggatgaacaatagtaac^gacacaaagcttatttctagaccacccagtggtgaagacgaagcagcatgagt 
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reaaaac 



10 



20 



25 



ttggttcaaaattgcmcaataaaatgatta^ 
aggctaaccta<»aaagagtgtctacaagcacctttacacttgcaaaaaa 
gagattggagttgattagatttggtgtgtgtgagattctggaaatgagcatgaagttttttgca^ 



atcctagat 

attotaccaagataacaatcttctgtggtgagaa^ 

gtgcagagagaaaacagaaaatgaagaacttgtgtattgaaggaattgaaaagccaattacaattgtgttg^ 

ctgaagtgtataactaactaaacagaatg^^ 
aggcttgatttgrtgccaccgaagaagaacat^^ 
gaagga<*gcggtcgtcgaaggttaagt^^^ 
gaagcggatgcggcaga^ccatgggttcagatc^^ 
actgaactgtaaactgttgtattattgattagctgatctagtagtgatggaagcttgc^ 
tc^gaggtccttcaatggtgattmcaccalggagatgcagcggaagataaatgaagaggtga^^ 
aataaacMggaagaaggagcttcacca(xaagagagtgtcttggataagaagcttagagagg^ 
15 gagagagggggaacaaaaaattgaaggaggaaaagagggaga^ 
ttacaacaagtgttacacatgcttct^^ 

taa^tccttgagaaatttccttgagaagcttwttaagaagattcctagagaagc 

teacctccttgagatgagaagctag^ 
aaaagttcctactacaaagattacte 

agaaggaaaaacctattc^tatttacaaagagagtggacccaaccttagtccatgggctcagaaat^ 



jaaaaacaatgtacaaaggataatag 

aaaaactgaaactgagatatactcattacatagaagaattacatgatgaatctgaaa^ 
ctttcagagttcattttgacaataggtaaattatggacttcaacttagacctcaagggtatgcttgggagtg 

gaaggagggaaattttttaattctagtaaat^^ 
cctgatcttttatttgtt^^ 

ttgtcctgttgtctgtgmctgtmctttctttagtgtdcgcctttcc^ 
tttttttacrtgtactataactttccaaacaaatag^ 

aaaaagcaagcagaggctaaagaagtatatcttattgtgaagagtaatctcaattggttaa^ 

tagaagtgggttgattastttatacgaacaattatttaaatagggttgatggtgtgg^ 

accagcctcagtgcagtcattttaagttccattatttttta^ 
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gttcttgaaaagatgggagctaaggttacatggtcagagaacagtgtcactgtttctggac^ 

tgcgaggcattgatgtcaatatgaacaagatgcc^ 
agagatggtatggmagctgmgttmgctcaagagcttgtmcacatatttgtg^^ 
ttaatgaaattattataattatactaatattgaaattggatcactaaccatgt^ 
5 mcttttatctatgtagtgtggggcat^ 
catgactgtcttttgcagcaaactggt^ 
ctacttataagatttcaacatgatacttgaatgaccacccttactttaaaggt^ 

ttattcttattattttttaa^ 
tacctttttmcataaattatttm^ 
10 aatactamcaatgactgggccttattatWctggmgggttatacaactgtccmga 

atggaaggcactagagtgcagtgcatattgcacaaggtt^^ 
cctatgtcttcatctaatagctaa^ 

ggcaggtggacctgcgcattattcacgtgccaggctcaaagggattttgtatgagg 

tgtggttcacctatcatgtgtcaac^^ 
15 acctaaggaacttgaatgcttcttgttctgagatctttatttttttt 

agaattagggggtcagctmctgttgtaagcgtg^ 

gcaaacttcmgttgctcagattcccatttattcttgmcctctatt^ 

aatctgcacagaactcagaaaggtcttgc^ 

ttcagctaggagcaacagttgaagaaggtcctgattactgtgtgattactccacc^^ 
20 tgaccacagaatggccatggcattctrt^^ 

tcctgactacmgaagtccttgagaggttaa^ 

cacaaagcagtgagtatetggcttgctm^ 

gttgtatcaaagtaatttaattttcagtaccgacttggctctactca^ 

gatgggaccatacatttmgtttatlatttaagtcaaacatatcagataate^ 
25 gattgcatgmgagttcaggacatttattaatgagtgacagaattttaaaaagctcg^ 

taaacactaactgtaacttgagmgataatcagattccccccgttggtaggtaatat 

tcttgacttcgaagtggtgagtgtaattctc^ 

agcaatgaaagcatgaccatggccttggggggccaaatggccatgtaccactttt^ 
cctmgtttcattttttttacaataaagaaagttcatattcttggc^ 
30- cttcattgttcaaatcctttagtc^ctaaggcagggcctatgtaagaattttg 
aatataaattattgcatatctagaggtgaggt^ 
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tggttatttgtcctttaaatatgcgattataggttcaatctttggttaatctatcaaatgcatctc^ 
ctgagagatactttgagttaaa^ 

atgttacaatgtactgaatgttttacaatacagtctggtttgataataaaatatgcac^ 
atgatgatatgcgaattgaaagaaaga^^ 

ggaagggtactggctcatgtgggatattaatttttatcccttacctaaaaagagagaaagagaaaa^ 



acc: 



tgagaaaaaaaatgtgccatgtaaattccagagattgaaatt^ 
agaatttgaagttcttc^^ 

acattagtatatatatatatataaacacagcttatattatattaatgagtgatgcgcgagg^ 

aaatgtaatttaattttttmctttaagtattact(^ttgaata 

tcctgtgattgttotacactagtcc^ 

gatg^gccttcatttcaag^ 

gcacacaaaaactagtagacaataattagtactocttgggttccattaaat^ 

atatacatatgttttatgagmcattgaatttatatttacacaaa 

ataatttcatgcmgatWgattgtacgttttttgtttto 

ccgttcaaaagagagacgacaaggataatactaatactagtagattccacgcattcaaccctagto 

ggtaagaaaaggggaggagaaaagtaagaatgaaaa^ 

aagataagcttoaactc^^ 

aamgcgttgttgaattc^^ 

agcttttactttgattataagtaatgcatgcaattaatgggcaaataaa^ 
ctcaagttcgcattgaaattaaataaaccaatgcatattaaggcgtaaatctaaa^ 



aggattttcagcccaaattaaaaga 
taattaactctataatca^ 
cttoxacggtagagaaatat^^ 
accacccattattatgttcccttaata^ 
aaMccttaaacaaagagcctctataaatctattttaaaaaaacccccaa^ 

Soyabean genomic DNA encoding an EPSPS (SEQ ID No. 32) 
aaaagtctgaatgaagtgaa^^ 
) ccg<xttttaatcacaaattgtcgtgtatcgaaagtttgttgaattttatag^ 

accatgtaaatttttttaalggg 



WO 02/26995 



PCT/GB01/04131 



-44- 

aagactgaagtgaagttgttgtgmggggggttgactcttttaggaagaggamgagggaamcgccaatgggggagcagaactoc 

gggacaccctgagaatmgagactccgggaattgaagfocaacaggttt^^ 
attttttatttttattgtttatttatccattattgctatttttgtaggtcctcttggtcagggga 
gracttggctgcaagatacaacaagcctgacaatgagattgttgaccattacacgtaaaaataactaaatcatacac^ 
5 ttagttgttggtgttttgctaatggcgtttgttttatgcaggtatgctattctgggtgatggttgtcaa^^ 
gctcacttgctggccactggggactggggaagctgatagcat^ 
caccgagagtgttgatagtcgttttgagggacttgggtggcat^ 
gcxattaaggaagcaaaagctgtcaaagacaaacc^^ 

ttgtagttaccataacaatcataatctgtcatgatttatagggattctmaaactaaggttataaacatg^ 
10 attgtagtaccteamgagaaactggaagttataggtggga^ 
ttgctaactctatcatttcacaggtcacaaca^ 

ggtgccaaagaagttgatgccaa^ggcagaaccttggatggtcacatgagccattccacgtgcctgaggatgtc^ 
gatgggtttagagatttctttmcctgjgacactgctgatg^ 

gccacacccctgagggtgctgcacttgaagctgagtggaatgctaagtttgctgagtatgaaaagaaatacaaggaggaa^ 
15 aattgaaatctattatcaatggtgaattccctgctggttgggagaaagcacttccggtgagtaaattctaaactcaca^ 
ttgtaacttttcaaggmggattacatactatccaaagttgatcatgatttcagtttgatttggtagtatgggg^ 
attaaaacctggcttccaatttgatttcagacataractc^ 

gcccttgcaaaggttcttcccggtc^cttggtggcagtgcagatcttgcttcttccaacatgaccttgctc^a^ 

aaaggatactccagcagagcgtaatgttagattcggt 
20 ctggactgattccatattgtgcaaccttctttgtatte^^ 

tttatgtratgacccatgattc^taggacttggagaagatgg 

ccaaacattttgatgcttcgtcctgccgacggtaacsgaa^^ 

ccattcttgccctatccaggcaaaaactgcccca^^ 

caactccactggcaacaagcctgatgtcatmgatcggaart^^ 
25 aggaagggaaggctgttagagttgtttcccttgttte^^ 

ctgctgmcagccagagttagcattgaggcaggatcaacatttgggtgg^^ 

tgafcgtmggagrtagtgctccagctggaagaatataca^ 
atctagaacttttgattttttmgccttctggttttggttgagagcattccatgtcatgaataaga 

ttgggattctaagtgaagaagcaacatagatcaaagaatttt^^ 
30 aattcaatgmgatcatatcaaccatagatgctagm^^ 

atggccgtgtmgttcattaagaaatgttgacatggatagmgttgcatggtaaattggtgattaatacc 
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gttaaaaagtgttttgttttaatctctagttaatctaaaaatgcctacttattttatW^ 
acttgtctatataagttccttatggaaaaagga^tacaaaataatttaaaaatgctgaatttgto 

aattgttttttmcatatatataattgactttcaaaacttgtctate 

gccaafctctecctagtg^ 

acgttaggggtgtaaattcaccattcatcatttcattcattatac^ 
aaaaatctttataaaataaataa^ 

ataaaaaaatattggatatcaaacttaatgaacagaacaaaaattacaaaattgtacaatt^ 
aaaattaaggacgaaamgagaaatgataattaaaaacgattaaaaatataatt^ 
tatatatatatatatatatatatatatatatataataaaagttttaca^ 
gttggtgagagtgaaacggagaacagtgtotccctcacttg^cttrtgagte 

gtatgctatgtaaccattttgttttttaataaataaaagtgt^ 
atttctatttagttcctagttgctacaacttccagctggtgaaaaatagtcaacagagactacaa^ 

aaatatattcatttaatatgattttatttttatttaatttaaaacatgaatta 
aaaaatttctacaattatagaaaaagtttagttttgtagattgtta^ 
gttttaaagtaattatcttatatatgtacccaaaagttttaaagcaatta^ 

gaaagtataatttttaaattatt^ 

attttattccacttttaccaata^ 

aaatttgtataaactttcttat^^ 

ttttttaagaaatttgttcaaacatatttttatgacatgtcaatgcttgagagate^ 

agaagaaaaagataattaatttaattttttattaacattctaacaaattaacattata 

tattgttcttaaacaccaaaccaaccttcctaaagtcctaatctaataaaaag 

ttaccaaccttttcactcaattattttggtgtaagcaattcgacatgtgttagtgttagttg 

tccaacaacctctcacaccaccaacccccccataactcataaggccaaaaw 

atcacacacacaaaaaacctctcatttcaa^ 

ggaaaatcgaaatggcccaagtgagcagagtgcacaatcttgctcaaagcacteaaa^ 
caaatcggygaattcggtttcattg^^ 



gggaaat 

ccggagatcgtgttggaacccatcaaagacttctcgggtaccatcaca% 
ctgctctrtctgaggtaaagtttatttatttatttatttctgtatcca^ 
acttggcttctttcttagtcatccaa^ 
cgtggagtggattttgaatalttgattttttt^ 
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agtggcgtatgatttagaatgatgatagaaatgt^^ 
tttctgtataaaggagtatgctagcaat^gttaataatgtcttgttaaaagtat^ 
gtatacatatctacaggaaatgtattaggtga^ 
tggttgatatt<x:aatacattcatgatgtgttatttacgggaacgcaatattgact^ 

5 ccaaagaaatemgactcttgattat^ 
atggtgctgcaagttccgtgttactte^^ 
tttttattcttatttagggaacaactgtt^^ 

gcgtgtggaagaygacmaaacaaccaaacaagcaattgtkgaaggctgtgggggattgm^ 
aatcaatttattccttggaaatgctggtactgcratgcgtcctttgacagcagc^ 

10 mgtttgttcatcatgatctc^ 

ccagattagtcatccttctttaaatgaacctttaattatatctatgtattta^ 

tatactgattacactctgaggaatgcacctgagggtc^ 

gattactggtttgagtctctaaat^ 

aattggggamggttgctggtcttaagcarctyggtgcagatgttgattgctttcttggc 
15 agggaggacttcctggcggaaaggtatggmggamcxattagaataaggtggactaacttt^ 
gtttccccctctttagaataaga 

atattctttaamgataaattttttaaacgaaggaatatgaagatag 

cctctatttaaadacaagatatt^^ 

ttggttggagacctttccttttcaga 
20 cctgtgaaaagaaattgtttacttttctttatttgttacctacatcct^ 

ttattctctccaaagaaaatat^^ 

gttagcagtcaatacytractgcmgc^^ 

catatgttgaaatgactctgaag^ 

caaaagtacaagtaggmctatgttttagcattacatcactttttagtato^ 
25 ctggcaatgctttgttgaaggtgatgcttcaa 

ctgtggcacaagcagtttacaggtattcatgcttgacactattttctattatc^ 

tcttttetcWcccctcacttc^ 

gtgattatcctttttagtaaacttttttttt^ 

tatttcattaacagtgataaagataaaaataaattgtcaagtaaagcaagttagcatgtcaga 
30 agcttgcacagtacattagctttggtgaca^^^ 

cacttaaacacggattagagagagtgaggatgaacaataataacagacacaaagcttagttttatac^ 
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tctggctcaacaatag^ 

gatgttccctgtgcaaaacaggttcaccgacaaaagagaacatctacaagt^ 

jcatgaagttttttcctaagctc 



ccatgctgcgtaggactoattgattcta^ 
gtgaaggctgagtcttgatgataggcttgatttgcttgcg^aagaagaacgttatt 

gttac^gcgattggagatgacggacttgtggtgggaggagggtaagtgtta^ 



10 aactgtaaactmgtattattgattagctgatttggtagtacaagagag^^ 

gaactctccatacacagaaagacgtatacm 
aatctgagaattttaattgggaaactgaggaaaatagcaagggtttcagaa^ 

ctcaagggtacgcttggga^ 
tttttaaataaaacaaataaacttatagattagcataca^ 
15 gtmc^cttcagtgtctcttgtcWccttaattccttatca^ 

ctatagtatgtataactattcaaataaatagtagaa^tgtttcaaa^ 
agaagtatatcttcttgtgaagagtagtctcaagtggttaattgtcaaatt^ 

acgaacgattatttaaatagg^ 
taaattccattattttttaatttgtgcttatgttaaactttgtc^^ 
20 taaggttacatggtcagagaacagtgtcacygttwctggaccrccac^ 

atgaacaagatgccagatgttgccatgacwcttgcygttgtygcactatttg^ 
agctgtttgtttgtgttcaagagcttgttttaacaaattgttt^ 

ctaaccatgtgaaatgctt^^^ 
aagcttcgtacaacctggtctgtgggctggtcc^ 
25 acccacccagattggcctgttctttttaaataaaacttggctactta^ 
attatctgttcagttatgttcttcttcttcttattattttt^ 
ctagtttggttacatca 



Tcttaaatetctttttagtctttgcaat 
cttcaatagttttcttcttaa^^ — ~& 

30 ttaacgtttttttgcttttagtcctte 

ggtaaagttgtg 
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ccctcccccataccttcgcatagcgaagagcctctgggcaatggggtagaaaaagaagaatgctttag^ 
caaagcactatctaaagtgctttgaggactaaacacaaaacaaacataatttgcaaggactgaaaaaca 
caaaaaaacaaaaaaaaaaaacaaaattgcaaggaccaaaaatatatttaaaccaacatgaaatM 
catctaalagcttaagctmgggatagttggttaacaatatggtattagagtct^ 
5 ccagttctagttaamcttmggtccacttcaaataaaaagttgaatttaaggg^ 
ctcaaagggctmgtatggggggcatgtttggaatato^ 
acttgtattattacagtgtctaaactaga^^ 
atttgtttatttmccaatctttttaacaattgaaaggt^ 
mgctgctcagattctcctttggactatctaccatatcatttagcaaacctgttc^^ 
10 gtggcaagttggagagttaaagagactgagaggatgatagcaatctgcacagaactcagaaaggtgagtgtctt^ 

gtttcgatattactggactttttacatctcactgacctactctgtcttgg^ 
gtgattactccacctgagaaattgaatgtcacagctatagacacatatgatgaccacagaatggcca^ 
gggatgttccagtaaccatcaaggatcctg^ 
ctaagcattttgtacattgagtaggaagagagagagagat^ 

1 5 ttcccttgtaagaacttgagataatttgtattgttggaaagagctattgattctgttg^ 
attcctcatatgtggtggtgtacattgcatctttgcatttgatcccatttctattgggatag 
ttagcagataatatgtggtgaccagaaaatagtctagctagcaaatagacctgtgattgcatg^ 
taatgagtgacaatgttaaaaagctcgccaacaggagttgacacatctgattgagtcttctato^ 
cagattcctcctgttggtaggtaatatgtaaatttcactatatatttaccgaggtt^^ 

20 aacttttaMcaaaaaaagattgctaaaagctt^^^ 
gccacgtaccacttttccaccaacttgagagccrt^^ 

ttggcaattgcaagtcgWgtggaagtcacatgaaattcaccataggcttcattgttcaaatcct^ 
tgcaagaattttggctactcttataaatg^ 

cccatccatttttccaagaactaaaatatatataaattattaatatgatattacatt^ 

25 attttaagtctattcatacgaaaaaaatattt^^ 

caacagatgatttgagttaaaaaaatgtatatataaactattgcatatctgctga^ 
ccatgtgacaatgtactgaatgttttagaatacagtctggmgataataaaacgtgcata^ 
aaatgatgatatgcgaattgaaagaaagaaaaccataatgtaagtgcagttttctctrt^ 
ggaaggggtattggctcatgtgggatattagtttttgtcccttacctaaaa^ 

30 ccgctaccacctttttcgcttt^ 

ttttatgttgggaaaaaagtgccatgtaaattcta^^ 
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aagaattgaagctctccaacaagg 
tatatatatatatatatatata^^ 
ggtggtgacactalatggccaaaacaa^ 
gcaccttgtgaacgaagcaatcttgtcc 
EPSPS 10 reverse (SEQ ID No.33) 



EPSPS 4 forward (SEQ ID No.34) 

gcwggaacwgcmatgcgiccrytiacigc 

SEQ ID No. 3 1 with double mutation (SEQ ID No.35) 

aaaaaataccttttagtaaaataagagcgtmgatcaatcagacgtmctcmg 

gaaactaaggctaaaatcaa^ 

aaggatcatttgtaaggtcxaacg(xttaaaatgatcacccttea^ 

tctgatttcctctccaaaggagggtacgtaggagcaaaagccccgcttttg^ 

acacaatttccacaattctagaaataaggttgttgtctttcg 
cgaacttagaatm^ttttgaccggWccttcggttttccegacgttttccac^ 

ttggaaagcgcacoatgagcchjgc^ 

ctatggacctcttaaggtgattatcaaagtggaagtggtgacctaca^ 

glatctcagctgaaacatgctatcaatccaaatgttatcagaggaacatgagctgcatgttcaacc 

atgtcaaatggcacaagtgagattttgaatttctgatgttgtaaaaatctcag^^ 
aatcxaaactgaccxaaaattctcaaatcacatgaaagcaaaaatgcatataacacgaagaataag 

gmcgatgattaaagcgttgttgttg^ 
cgaagatagagaaaaagatatatagattttcaaggctca^ 

icacgaagaacaaaaggtg 



tgagagaaagaacaaagg 
attaaaattattttaaaaaatgatgtcattttaaattctttaaa^ 

aaattatcttatttaaatacc^ 

cagttcatggaatccttgatagataatttaaaggttgctggaaattggatgaa^ 
atttggagagtgttgtgtggttgtrtcccctgtaggcaaaagcttcgatgtaaagg 
attattgaaaatgaatgacacattttatttggttgaaatcaagaaataagcatg^ 
ggtgaatgcagcaaaagaattaatgaactccttttcgatctacttggatcactacatggagatattat^ 

agcagttggaattcttggaalgacaagatatgaaatgaacataccaaccrt 
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gcaaagtgc^ggtaatatgctc^tcaacatcaattaacaaatgttcatgacatrt^ 
ccaccgtcaagmccttaaatgcaacattaatgttgctcatttcaaggaggagaatagtm 
attcgtcaaagctcactcacgtmcgacatgggtcgacatgggttacctgacccaaag 
ttgatctgggcccagaclattggttto^ 

5 ataagattatcctatttgtgcatgaa^^ 

aaagatattggacatacaagctaaaagtgataga^ 
caaatttttmgtttatttaaaataaagtggttcattaggtagctcttgaaaaaa 
ataatattctattataagtaaagaggaataattt^ 
attttattttgttaatagttgte^^ 

10 tgttactaaatataaamcatggtatactcaggacggttataacggaaatca^ 
agaataatattaatttattctgtttaca^ 
gmcttattttattaccatmgttcacttctcaattaataaaataaaaatate 
tacagatttttctctattaactaattaattaaaaattatccttatttto 
tatgtaccaaaaagttttaaagcaattattataaaaattaataaatttatcatataaaat^ 

15 aaattatttaaaatttttgaatgat 

aaacacttataacaaaaaaagtaagtaaaaaataaaaaRaagtttttcaataaagtataaactaattt 

ttatacattgataatataaattttttacataattatccgatcaactcattata^ 

aacaatgatttgcaattagatgataatataaaattattto^ 

taactcaaaaaaaatgttacttataatm^ 
20 actactctcttagacaccaaacaaccctto 

ttcaccaactttttcactcaattgtttt^^ 

aacaaccactcacaccaccaacccccataaccaffirt^ 

tctcatttcagagagagagagagagacttcacagarc^ 

caagtgagcagagtgcacaatcttgctcaaagc^^^ 
25 attgaggccacgcctttggggggcctcaaaatct^ 

gaaattccggcgtgtttaaggWctgcatcggtcgccgccgcagagaagccgtcaacgtcgc 

agacttctcgggtaccatcacattgccagg^ 
tatttatttgmgmgltgttgggtgtgggaataggagtttgatgtgta^ 
aagc^tcatgtcccal^aaagaaatggacacgaaattaagtggcttatgatgtgaaat^^ 
30 gtagcaataagcatattcaatatctggattgatttggacgtttctgtataaaggagta^ 
ctcctaaaaatcaagtgggagtagtatacatetctacagcaaatgtattaggtgaggcattt^ 
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cagttaatgtgaaaatcaatggttgatattcc^^ 
aagattatttattttatcdctaaagtgactaattgagttgcttactttagagaagttgg 

ttgaatatacaatctgagtcttgattatccaa^ 
gaatgacatttc^gttccatgtatgtcactttcattctattatttttacaacttttagto 

5 trtgtttttttttagggaacaactgttgtagacaacttgttgtatagtgag 

gtgtggaagatgacaaaacaaccaaacaagcaattgttgaaggctgtgggggattgttt^ 
aatttattccttggaaatgrtggtactgcaatgcgtcc^gacagcagctgtggttgctgca^ 
gttcagcataat^gaattgttcctcgtataactaatcacaacagagtacgtgttc^ 
tcatcctttcttcttaaaaggaacctttaattatcaatgtatttatttaatatttaaato 

10 ctgagaaatgcacrtgagggtgttcctoatgat^ 
accatettgatgttcaaaatamcagctacgto^^ 

aagcaacttggtgcagatgttgattgctttcttggcacaaactgtccacctgttcgtgtaaatgggaa^^ 

gtatggtttggatttcatttagaataaggtggagtaactt^ 
gactaagggtaggtttaggagttgggttttggagagaaatggaagggagagcaattttm^^ 
15 ttttttaagtaaaagaatataaagatagattagcataacttaatgttttaatct^ 
aatamgtcctccattccctttccctte 

cctmcagagattatccctcacctttattatagcctttctatttttaaacttcatatag 

gttataccttcccctccttattctct^ 
gaaactgtctggatcagttagcagtcaatactt^^^ 

20 



ttggtccatggaggtcaaaagtacaagtaggmctatgttttagtgttacatcactttta^ 
tcctattaggtctcctggcaatgcttttgttgaaggtgatgcttcaagtgccagttatttactagct^ 



ttcaacccccttctmctctttcccctcactcccaaaatagagcttatcattgtatcxt^ 
25 attctacatgattatcctttttagtgaactttmgtcttttaaataattgatatca^^ 

atttcattaaga^gataaaattacattgtcaagtaaagcaagttagcgtatcagataatga^ 
cacaattcattagcttagtgacaattgtgatgtagatagccataatcttccatgcaacttato 
(^tgaattagagagttaggatgaacaatagtaacagacacaaagcttatttctagaccacccagtggtg 
gagattaatggcagcatctgattgaagcttctWggtattattggagatgaaatcttgagccaa^ 
30 ttggttcaaaattgctttcaataaaatgattacagaatcaaattcttaagaaaga^ 
aggctcacctacaaaagagtgtctacaagcacctta^ 



BNSDOCID: <WO 0226995A1_I_> 



WO 02/26995 PCT/GBO 1/04 131 

-52- 

gagattggagttgattagatttggtgtgtgtgagattctg^ 

atcctagataggattgaamgagagagaggcttgaagtcaagat^^ 

attctaccaagataacaatcttctgtggtgagaatatgagtgggaattgtcaggcaacaaagaa^ 

gtgcagagagaaaacagaaaatgaagaacttgtgtattgaaggaattgaaaagccaattacaattgt 
5 ctgaagtgtataactaactaaacagaatgctaccaacctagaggcagttgagaattgctcccto 

aggcttgamgctgccaccgaagaagaacattatt^ 

gaaggacttgcggtcgtcgaaggttaagtgtt^^ 

gaagcggatgcggcagaagccatgggttcagatcagaacM^ 

actgaactgtaaactgttgtattattg 
10 tcaatgaggtccttcaatggtgatttttcaccatggagatgcagcggaagataaatgaagaggtgagaggaggc 

aataaactatggaagaaggagcttcaccaccaagagagtgtcttggataagaagcttagagaggaagctt^ 

gagagagggggaacaaaaaattgaaggaggaaaagagggagagaagttgaactttgaagtgtgtctcacaagactcatt^ 

ttacaacaagtgttacacatgcttctatttatagtctaggtaacttccttgaga^ 

taacxtccttgagaaatttccttgagaagcttccttaagaagattcctagagaagctaa^ 
15 tcacctccttgagatgagaagctagagccttagctacacacacccttataatagctaaa^ 

aaaagttcctactacaaagattactcaaaatgtcrt^ 

agaaggaaaaacctattctaatattacaaa^ 

cctagggtcttctttagtagctctagtccaattttcttggagtcttctatccaaM^ 

gggaatalatatacagttgtaagttgtaactacctaa^ 
20 aaaaactgaaactgagatatectcattacatagaagaattacatgatgaatctgaaaattt^ 

ctttcagagttcattttgacaataggta^ 

gaaggagggaaattttttaattctagtaaate^ 

cctgatcttttamgtttcatgaagttttttatagcctttctgttt 

ttgtcctgttgtctgtgtttctgttttctttctttagtgtctcgccmcctt^ 
25 tttttttacctgtactataactttccaaacaaatagtagaa^ 

aaaaagcaagcagaggctaaagaagtatatcttattgtg^ 

tagaagtgggttgattagtttatacgaacaattatttaaatagggttgatggt^ 

accagcctcagtgcagtcattttaagttccattattttttaattgtgc^ 

gttcttgaaaagatgggagctaaggttacatggtra^ 
30 tgcgaggcattgatgtcaatetgaacaagatg^ 

agagatggtatggtttagctgtttgttWgctcaagagcttgttttca^ 
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25 



ttaatgaaattatta^ 

mcttttatctatgtagtgtggggcatctat^^ 

catgactgtcttttgcagcaaactggttaaacttgggtgacctgttgacccacc^ 

ctacttataagatttcaax:atgatacttgaatgaccacccttactttaaaggt 
ttattcttaltatttttt^^ 

tacctttttmc^taaattattttttcccataatatatttatatg^ 
aatactatttcaatgact^ 

atggaaggcactagagtgcagtgcatattgcacaaggttctttaatggcatcttgaatttgttagaacatgg 
cctatgtcttcatct^ 

ggcaggtggacctgcgcattattcacgtgccaggctcaaagggattttgtatgaggggg^ 

tgtggttcacctatcatgtgtca^ 

acctaaggaac^gaatgcttcttgttctgagatctttatttttm 

agaattagggggtcagcttttctgttgtaagcgtgaattaagtgcattto 

gcaaacttettgttgctcag^ 

aatctgcacagaactcagaaaggtc^^ 

ttcagctaggagcaacagttga^ 

tgpccacagpatggpc^^ 

tcctgactactttgaagtccttgagaggttaacaaagcactaa^ 

oacaaagcagtgagtatatggcttgcttttgmctaaacattccttta^ 

gttgtatcaaagtaatttaatttl^gtaccgacttggctctactcatatg^ 

gatgggaccatacattttttgtttattatttaagtcaaacatatc 

gattgcatgtttgagttcaggacatttattaatgagtgacagaattttaaa^ 

taaacactaactgtaacttgagtttgataatcagattccccccgttggtag^ 

tcttgacttcgaagtggtgagtgtaat^^ 

agcaatgaaagcatgaccatggccttggggggccaaatggccatgtaccac^ 

ccttttgtttcattttttttacaataaagaaagto 

cttcattgttcaaatcctttagtca^ 

aatataaattattgcatatctagag^ 

tggttatttgtcctttaaaMgcgat^ 

ctgagagatacttgagttaaaaaaatgtat^^ 

atgttacaatgtactgaatgttttacaatacagtctggWgataataa^ 
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atgatgatatgcgaattgaaagaaagaaaac^^ 

ggaagggtactggctcatgtgggatattaatttttetc^ 

accacctttttcgcttt^gttttgtte^^ 

tgagaaaaaaaatgtgccatgtaaattccagagattgaaattttagcttcaaattaatatttatmattatc^ 
5 agaamgaagttcttcaacaaggaaataaacacttcaatgctcaatccltagagtcagtcactagatcccaaa^ 
acattagtatatatatatatataaacacagcttatattatattaatgagtgatgcgcgaggacggtggtgacac^^ 

aaatgtaatttaatttttttttct^ 
tcctgtgattgttctacactagtccttacgtgtacc^ 
gatgctmgccttcattttcaagmgtttacac^^ 
10 gcacacaaaaactagtagacaataattagtactacttgggttccattaa^ 
atatacatatgttttatgagttte^ 
ataatttcatgctttgatmgattgtacgtttt^ 
ccgttcaaaagagagacgacaaggataatactaatactagtaga 

ggtaagaaaaggggaggagaaaagtaagaatgaaaacatataacaaattgtgattgaacttttccatgacgtaatgaaaacat^ 
15 aagataagcttcaactcctaagaagagatatgtcaaagaaacttaggattttttctctatttataacg 
aatttgcgttgttgaattcgttgtaaagggaaaatcaag 

agcttttactttgattataagtaatgcatgcaattaatgggcaaataaattatttgaaaccaactaaagtacaa^^ 
ctcaagttcgcattgaaattaaataaaccaatgcatattaagg^ 

aaggtactttgaatttagttgttattgaccgtta^ 
20 aggattttcagcccaaattaaaagaagttttacggWgattctctacaaac^aattgagatttcta^ 
taattaactctataatcaaaccttttaacccaatcttmctcaattctctgaattaagacaato 
cttcccacggtagagaaatatagggagtcccttaatagcaaaag(xtcaaaattcagaaaatctcatccattgcattt^ 

accacccattattatgttcccttaataaatctcacacttaaaaaccctcctcacccataato^ 
aatatccttaaacaaagagcctctataaatctattttaaaaaaacccccaaatttaagga 

25 
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SEQ ID No. 32 with double mutation (SEQ ID No. 36) 

aaaagtctgaatgaagtgaagttgttgtgttagggggatgctgaatttgatttto 

ccgccttttaatcacaaattgtcgtgtatcgaaagtttgttgaattttatagto 

accatgtaaatttttttaatgggtttgtgtttgttttaac^ 

aagartgaagtgaagttgttgtgtttggggggttgactcttttaggaagaggatttgag 

gggacaccctgagaatmgagactccgggaattgaagttacaacaggtttgatatgagto^ 

attttttatttttattgtttatttatccattattgcta^ 

gcacttggctgcaagatacaac^^ 

ttagttgttggtgttttgctaatggcgtttgttt^ 



caccgagagtgttgatagtcgttttgagggacttgggtggcatgttatttgggtaaag^ 

gccattaaggaagcaaaagctgt^^ 

ttgtagttaccataacaatcataatctgtcatgatttatagggattcttttaaacto 

attgtagtacctcatttgagaaactggaagttataggtgggaaatcaaaatagtaaattatgggtca 



b „ D --- 0 - DC= - ^aaaaagtatggat 

gatgggtttagagamcttttttcctgtgacactgctgatgtttataatga^^ 

gccac^ccctgagggtgctgcac^^ 

aattgaaatctattatcaatggtgaatt^ 

ttgtaacttttcaaggmggattawtactatcwaagttgatcatga^ 

artaaaacctggcttccaatttga^ 

jctcaaaatgttcggggacttcca 



aaaggat 

ctggactgattccatattgtgcaaccttc^^ 

25 tttatgtcatgaccc^ga^^^ 

ccaaacattttgatgcttcgtcctgccgacggtaacsgaaacagcxsgagcata^ 

ccattcttgccctatccaggcaaaaac^ 

caactccactggcaacaagcctgatgtcattttgatcggaactggttcggaattggaa^^ 

aggaagggaaggrtgttagagttgmcccttgtttcttggggaacttW 

^ xx * — + ++ „„ rrtrrrtriQrTC ,^-,attD+tapa3caaaeeeaa 

30 



ctgctgtttcagccagagttagcattgaggcaggatcaacatttgggtgggagaaaattgttgg^ 
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atctagaactmgatttttttttgcctt^ 

ttgggattctaagtgaagaagcaacalagatcaaagaatt^ 

aattcaatgtttgatcatatcaaccatagatgcto^ 

atggccgtgtmgttcattaagaaatgttgacatggatag^ . 

gttaaaaagtgtmgttttaatctctegttaatctaaaaatgcctacttatto 

acttgtctatataagttccttatggaaaaaggacctacaaaataatttaaaaatgctgaatttgttaggaagW 

aattgttttttWcatatatataattgacmcaaaacttgtctatatattataaattc^ 
gccaatctcfcttctagtgaactgttattct^^ 

acgttaggggtgtaaattcaccattcatcatttcattcattatacacaatagtctttcataatccaatgaaa^ 

aaaaatctttataaaataaataaaaacatttttttgttataaatttcagttgaaatacataaccato 

ataaaaaaatattggatatcaaacttaatgaacagaacaaaaattacaaaattgtacaatttaaatgataaaaaacc 

aaaattaaggacgaaatttgagaaatgataattaaaaacgattaaaaatataattaaactaaaatto 

tatatatatatatatatatatatatatatatataataaaagttttacattataattttgaaatgaagtat^ 

gttggtgagagtgaaacggagaacagtgtctccctcactt^^ 

gtatgctatgtaaccatmgttttttaataaataaaagtgttttgttctgctatgcactccccat^ ■ 

amctatttagttcctagttgctacaacttccagctggtgaaa^ 

aaatatattcatttaatatgattttatttttatttaatttaaaacatgaattatttgm^ 

aaaaatttctacaattatagaaaaagtttagttttgtag^ 

gttttaaagtaattatcttataMgtacccaaaagttttaaagcaatlattataaaaaag 

gaaagtataatttttaaattattaatacgtta^ 

attttattccactmaccaataaatttcctcataaacacttataagaaaaaaagtaagtaa^ 

aaamgtataaacmcttatattagttWctgaaaactaattttaaattatacataaatt^ 

ttttttaagaaatttgttcaaacatatttttatgacatgtcaatgcttgagagatataaggtgagto 

agaagaaaaagataattaatttaattttttattaacattcta^ 

tattgttcttaaacaccaaaccaaccttrc 

ttaccaacctmcactcaattattttggtgtaagcaatt^^ 

tccaacaacctctcacaccaccaacccccccataactca^ 

atcacacacacaaaaaacctctcamcaaagagagagagagatttcacagaccaaattgcagaggacgacaaagttca^ 

ggaaaatcgaaatggcccaagtgagcagagtgcacaatct^ 

caaatcggygaattcggmcattgaggccacgccmggggkscctcaaaatrtcgwtcyygrt 

gggaaattttaatgyggggaagggaaattccggcrtgtttaaggtttctgcatcsgtcgccgccgccgcaga 
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ccggagatcgtgttggaacccatcaaagacttctcgggtaccatcaca^ 
ctgctctctctgaggtaaagtttatttatttatttatttc^ 
acttggcttctttcttagtcatccaac^cttatatctccaaaat^ 

cgtggagtggattttgaatalttgatttttt^ 
agtggcgtatgatttagaatgatgatagaaatgtgtataggtg^ 
mctgtataaaggagtatgctagc^tgtgttaataatgtcttgt^ 
gtatacatatctacaggaaatgtattaggtgaggcatttggcttctc^ 

tggttgatattccaal^ 

ccaaagaaatctttgactcttgattatccaagtatggttgtato^^ 

Ltttttataacttttaattatttaatagatgtcttgtttggcataaa 

cattaaggacccttggact 

gcgtgtggaagaygacmaaacaaccaaac^ 
aatcaatttattccttggaaatgctggtactgcratgcgtcc^^ 

tttgtttgttcatcatgatctctg^ 
15 ccagattagtcatccttctttaaatgaacctttaattatatctatg^^ 
tatactgattacactctgaggaatgcacctgagggtcttaaW 

gattactggtttgagte^^ 

aattggggamggttgctggtcttaagcarctyggtgcagatgttga^ 
agg^aggacttcctggc^^ 

20 gtttccccctct^^ 

atattctttaatttgataaatttttt^ 
cctctatttaaaaacaagatattmcctcxattccctttccctt^ 

ttggttggagacctttccttttcagaggtt^^ 

cctgtgaaaagaaattgtttactmc^ 

25 ttattotctccaaagaaaat^ 

gttagcagteaatacy^^ 
catatgttgaaatgactctgaagttgatggagcgttttggagttt^ 

caaaagtacaagtaggmctatgttttagcattacato 
ctggcaatgcttttgttgaaggtgalgcttcaagtgccagttay^ 

30 ctgtggcacaagcagtt^^ 
tcttttctctttcccctcacttcca^ 
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gtgattatcctttttagtaaacttttttmgt^ 
tamcattaacagtgataaagataaaaataaat^ 

agcttgcacagtacattagctttggtgacaattgtgatgtagatagccataacctcccatgcaa^ 
cacttaaacacggattagagagagtgaggatgaacaataataacagacacaaagcttagttttate^ 
5 gaccaagcagtatgagtgagatcaatggcagcatctgattgaagcttcttgtggtattW 
tctggctcaacaatagccattggttca^ 

gatgttccctgtgcaaaacaggttcaccgacaaaagagaacatctacaagttcctttacaca^ 

ccaagagtcatgttgcgaacttcaggttggagttgacgaaamggtgcatgtatgattctggaaa^ 

ataaaagtgaatgcagccaagaaccctagataggattgaatcggagagaggcttgaagccaagataacagcaact^^^ 

10 ccatgctgcgtaggactcattgattc^ 

gtgaaggctgagtcttgatgataggcttgamgcttgcgctgaagaagaacgttattgtt^ 

gttacaggtgcgattggagatgacggacttgtggtgggaggaggg^^ 

gccaaagaagctggttgcggtggaggaggtggtggtgcagtggaagccatgggtmgatgc^ 

aactgtaaacttttgtattattgattagctgatttggtagtacaagagaggaatatata^ 
1 5 gaactctccatacacagaaagacgtatacaaaggataacagacaaacagaaacttagatatactca 

aatctgagaattttaattgggaaactgaggaaaal^ 

ctcaagggtacgcttgggagtgggggtttttagagggaaaagggaaggagggcaaacttt^ 

tttttaaataaaacaaataaact^ 
gtmctttcttcagtgtctcttgtctttccttaattccttatcaatacctt 

20 ctatagtatgtataactattcaaat^ 

agaagtatatcttcttgtgaagagtagtctcaagtggttaattgtcaaattgtatcat^ 

acgaacgattatttaaatagggttgatgatgtggt^ 
ta^ttccattattttttaatttgtgcttatgtta^ 

taaggttacatggtcagagaacagtgtcacygttwctggaccrccacragatty^ 
25 atgaacaagatgccagatgttgccatgacwcttgcy^ 
agctgtttgtttgtgttcaagagcttgttt^ 
ctaaccatgtgaaatgcttgaatttttag^ 

aagcttcgtacaacctggtctgtgggctggtccagtccttcttactcatgacagtc^ 
acccacccagattggcctgttctttttaaataaaacttggctacttataag 
30 attatctgttcagttatgttcttcttcttcttattatttttaa 

ctagtttggttacatcatataatacctttttmcatatattattmcccate 
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gmcgttttmcctggcttgaatgctamcaatgactgggccttattatW^ 

cttcaatagttttcttcttaatcatggaaggcactagagtgcatat^ 

ttaacgtttttttgctttto^^ 

ggtaaagttgtgccttggtgacttgttggtcatgggttcgaatccggaaacagcctcm^ 

5 ccctccrccatocctt^ 

caaagcactatctaaagtgctttgaggactaaacacaaaacaaacataamgcaaggactgaaa^ 
caaaaaaacaaaaaaaaaaaacaaaattgcaaggaccaaaaatatatttaaaccaacatgaaato^ 

catctaatagcttaagcttttgggata^^ 
ccagttctagttaatttcttmggtccacttcaaataaaaagttgaattta^ 
10 ctcaaagggctmgtatggggggcatgmggaa^ 

acttgtattattacagtgtctaaactagacatgatctttttc^ggaaggaaam 
amgtttattttttccaatcttt^^ 

tttgctgctcagattctccmggacMctaccatatcatttagcaaacctgttc^ 

gtggcaagttgg^agttaaagagactgagaggatgatagcaatctgcacagaactcag 
15 gtttcgatattactggactttttacatctcactgacctactct^ , 

gtgattactc^c^gagaaattgaatgtcacagctatagacacatatgatga^^ 

gggatgttrcagtaacratcaaggatcctg 

ctaagcatmgtacattgagtaggaagagagagag 

ttcccttgtaagaacttgagataatttg^ 
20 attcctcatatgtggtggtgtacattg^ 

ttagcagataatatgtggtgaccagaaaatagte 

taatgagtgacaatgttaaaaagctcgccaacaggagttgacacatctgattgagtcttctatt^ 

cagattatcctgttggtaggtaato^^ 

aactittatatcaaaaaaagattgctaaaagctt^^ 
25 gccacgtaccacttttccacc^cttgag 

ttggcaattgcaagtcgmgtggaagtcacatgaaattcaccataggcttcattgttc^ 

tgcaagaattttggctactcttataaatgctgtgcggcaaagactacggaaaaat^ 

cccatccattttt(xaagaactaaaatatatataaattattaatatgatatta^ 

attttaagtctattcatocgaaaaaaatot^ 
30 caacagatgatttgagttaaaaaaatgto^^ 

ccatgtgacaatgtactgaatgttttagaatacagtctggtttgataataaaacgtgc 
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aaatgatgatatgcgaattgaaagaaagaaaaccataat^^ 
ggaaggggtattggctcatgtgggatattagttttt^ 
ccgctaccaccttWcgcttttaaattmgttctgttttcac^ 
ttttatgttgggaaaaaagtgccatgtaaattctagagattgaaattttagcatcaaato 
5 aagaatttgaagctctccaacaaggaaataacacttcaatccttagagtcatcctctagato^^ 

tatatatatatatatatatatata^^ 

ggtggtgacactatatggccaaaacaagtgtaatataattattttl^ 
gcaccttgtgaacgaagcaatcttgtcc 

FMVS35S90 enhancer (SEQ ID No.37) 
10 aagcttcccgggccggccgcagctggcttgtggggaccagacaaaaaaggaatgg^gcagaattgto 
atcWgcctttattgcaaagataaagcagattcctctagtacaagtggggaacaaaataacgt^ 
tcactaatgcgtatgacgaacgcagtgacgaccacaaaactcgagactmcaacaaagggtaatatccgg 

ttgcccagctatctgtcacmattgtg^^ 

cgttgaagatgcctctgccgacagtggtcccaaagatggacccccacccacgaggagcatcgtg 
15 cgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatgacgcacaatrc^ 

FMV35S46 enhancer (SEQ ID No38) 
aagcttcxcgggccggccgcagctggcttgtggggacca^^^ 

atcmgcctttattgcaaagataaagcagattcctctagtacaagtggggaacaaaataacgtggaa^ 
tcactaatgcgtatgacgaacgcagtgacgacca^ 
20 aagatacagtctcagaagaccagagggctattgaga 
atctgtcacttcatcgaaaggacagtaga^ 

tgcctctaccgacagtggtcccaaagatggacccccacccacgaggaacatcgtggaaaaagaagacgttccaaccacgtcttca^ 
gcaagtggattgatgttaattaa 
GmTerm Bam (SEQ ID No.39) 
25 ctgcagggatcctggttgcaccaggaag 
GmTermEco (SEQ ID No.40) 
gaattcccgggtatggtgaatttcatgtg 
BnMUT2PST (SEQ ID No.41) 
gacttctcgggtaccatcacattgc 

30 BnMUTkpn (SEQ ID No.42) 

cacctgcagcaaccacagctgctgtcaaagaacgcattgcaataccagcatttcc 
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GmRubPac (SEQ ID No.43) 
aatttaattaattcataaatatgttac 
GmRubbot (SEQ ID No.44) 

ggtcttctccttcttagttrtcaactacttgttcttctgrtgccagatttcaattttccttaaagtt 

5 GmRubtop (SEQ ID No.45) 

atdggcagcagaagaacaagtagttgagaactaagaaggagaagacc^ 

GmRubKpn (SEQ ID No.46) 
ggcaatgtgatggatcccgagaag 

ZMPPDK1 (SEQ ID No. 47) 
10 gcgccatggcggcatcggtttccaggg 

ZMPPDK2 (SEQ ID No. 48) 
gcgggtacctcagacaagcacctgagctgcagc 

15 ECPPS 1 (SEQ ID No. 49) 
gcggcatgcccaacaatggctcgtc 

ECPS2 (SEQ ID No. 50) 

gcggatccttatttcttcagttcagcca 
20 EPSPS motif (SEQ ID No. 5 1) 

ALLMZAPLA, wherein Z is not S or T 

EPSPS motif (SEQ ED No. 52) 

EE6IZDKL, wherein Z is not V 

EPSPS motif (SEQ ED No. 53) 
25 FG(V/I)(K/S)ZEH, wherein Z is not V 

EPSPS motif (SEQ ED No. 54) 

AL(K/R)ZLGL, wherein Z is not R 
EPSPS motif (SEQ ED No. 55) 

GLXVEZ 1 DXZ 2 XXXA(I/V)V, wherein Z, is not T and/or is not E 
30 EPSPS motif (SEQ ED No. 56) 

ITPPZ 1 K(l/V)(K/N)Z 2i wherein Z x is not K and/or Zz is not T 
EPSPS motif (SEQ ED No. 57) 
TIZ(D/N)PGCT, wherein Z is not N or L 
EPSPS motif (SEQ ED No. 58) 



BNSDOCID; <WO 022699SA1_I_> 



WO 02/26995 



PCT/GB01/04131 



-62- 

(D/N)YFXVLXZXX(K/R)H, wherein Z is not R 
EPSPS motif (SEQ ID No. 59) 
(IAOVEGCGG(I/L/Q)FP(V/A/T)(S/G) 
EPSPS motif (SEQ ID No. 60) 
5 (IAOWGCGG(I^/Q)FP(V/AyT)E 
EPSPS motif (SEQ ID No. 61) 
(W)VVGCGG(I/L/Q)FP(V/A/T)(S/G) 

EPSPS motif (SEQ ID No. 62) 
(I/V)WGCGGKFP(V/A/T)(S/G) 

10 EPSPS motif (SEQ ID No. 63) 

(I/V)VEGCGGKFP(V/A/T)E 

EPSPS motif (SEQ ID No. 64) 

(W)VEGCGG(L1./Q)FP(V/A^E; 

EPSPS motif (SEQ ID No. 65) 
15 (W)VEGCGGKFP(V/A/T)(S/G). 
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CLAIMS 



1 . A glyphosate resistant EPSPS enzyme wherein in comparison with the wild type 
EPSPS the protein sequence is modified in that a first position is mutated so that the 

5 residue at this position is lie rather than Thr and a second position is mutated so that 

the residue at this position is Ser rather than Pro, the mutations being introduced into 
EPSPS sequences which comprise the conserved region GNAGTAMRPL in the wild 
type enzyme such that the modified sequence reads GNAGIAMR3L, wherein the 
enzyme does not comprise the sequence K(A/V)AKRAVWGCGGKFPVE, 

10 characterised in that the enzyme is stable and possesses at least one of the following 

sequence motifs (i) to (viii) in which X is any amino acid and Z, Z x and Z2 are any 
amino acid other than those specified: - 

(i) ALLMZAPLA, wherein Z is not S or T; 

(ii) EIEIZDKL, wherein Z is not V; 

15 (iii) FG(V/I)(K/S)ZEH, wherein Z is not V; 

(iv) AL(K/R)ZLGL, wherein Z is not R; 

(v) GLXVEZ^XZjXXXACIAOV, wherein Z, is not T and/or 7^ is not E; 

(vi) ITPPZ^CLAOCK/hOZ^ wherein Z x is not K and/or 7^ is not T; 

(vii) TIZ(D/N)PGCT, wherein Z is not N or L; 

20 (viii) (D/N) YFXVLXZXX(BC/R)H, wherein Z is not R. 

2. An enzyme according tp claim 1 , which comprises at least two of the motifs (i), (ii) , 

(iii) , (v) and(vi). 

25 3 . An enzyme according to either of claims 1 or 2, wherein in motif (i) Z is A; in motif 
(ii) Z is I; in motif (iii) Z is A; in motif (iv) Z is K, T or A; in motif (v) Z x is R or A or 
less preferably D or E and Z^ is preferably V or A or less preferably T; in sequence 
motif (vi)^ is E or A and Z2 is P,Ior V; in motif (vii) Z is R or K; and in motif (viii) 
Z is T or S or less preferably Q. 

30 
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4. An enzyme according to claim 1, which comprises one or more sequences selected 
from the group consiting of: 

(i) (IAOVEGCGG(LT,/Q)FP(V/A^)(S/G) 

(ii) (IAOWGCGG(I/L/Q)FP(V/A/T)E; 

5 (iii) (IAOVVGCGGCI/L/^FPCV/AAOCS/G); 

(iv) (W)WGCGGKFP(V/AAO(S/G); 

(v) (IAOVEGCGGKFP(V/A/T)E; 

(vi) (IAOVEGCGGO/L/CDFPCV/AA^E; 

(vii) (IAOVEGCGGKFP(V/A/T)(S/G). 

10 

5. An isolated polynucleotide comprising a region which encodes an EPSPS according 
to any one of claims 1 to 4. 

6. A polynucleotide according to claim 5, having a sequence depicted in SEQ ID Nos. 4 
15 or 5. 

7. A polynucleotide encoding an EPSPS, which polynucleotide is obtainable by 
screening plant genomic DNA libraries with a polynucleotide constituting an intron 
located within SEQ ID Nos. 4 or 5. 

20 

8. A polynucleotide according to any one of claims 5-7, comprising the following 
components in the 5 f to 3 1 direction of transcription:- 

(i) At least one transcriptional enhancer being that enhancing region which is 
upstream from the transcriptional start of the sequence from which the enhancer is 

25 obtained and which enhancer per se. does not function as a promoter either in the 

sequence in which it is endogenously comprised or when present heterologously as 
part of a construct; 

(ii) The promoter from the soybean EPSPS gene; 

(iii) The soybean genomic sequence which encodes the soybean EPSPS 
30 chloroplast transit peptide; 

(iv) The genomic sequence which encodes the soybean EPSPS; 
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(v) A transcriptional terminator; 

herein the soybean EPSPS coding sequence is modified in comparison with the 
wild type sequence in! that a first position is mutated so that the residue at this 
position is lie rather than Thr and a second position is mutated so that the residue at 
5 this position is Ser rather than Pro, the mutations being introduced into EPSPS 

sequences which comprise the conserved region GNAGTAMRPLTAAV in the wild 
type enzyme such tha;t modified sequence reads GNAGIAMRSLTAAV. 

9. A polynucleotide according to claim 1, comprising the following components in the 
10 5' to 3 f direction of transcription:- 

(i) At least one transcriptional enhancer being that enhancing region which is 
upstream from the transcriptional start of the sequence from which the enhancer is 
obtained and which ehhancer per se does not function as a promoter either in the 
sequence in which it is endogenously comprised or when present heterologously as 

15 part of a construct; 

(ii) The promoter |from the Brasicca nqpus EPSPS gene; 

(iii) The Brasicca foapus genomic sequence which encodes the Brasicca nqpus 
EPSPS chloroplast transit peptide; 

(iv) The genomic sequence which encodes the Brasicca napus EPSPS; 
20 (v) A transcriptional terminator; 

wherein the Brasicca\napus EPSPS coding sequence is modified in comparison with 
the wild type sequence, in that a first position is mutated so that the residue at this 
position is lie rather than Thr and a second position is mutated so that the residue at 
this position is Ser raiher than Pro, the mutations being introduced into EPSPS 
25 sequences which corrinrise the conserved region GNAGTAMRPLTAAV in the wild 

type enzyme such thdt modified sequence reads GN AGIAMRSLT AAV . 

10. A polynucleotide according to any one of claims 5-9, further comprising a sequence 
which encodes a chloroplast transit peptide/phosphoenolpyruvate synthase 
30 (CTP/PPS) or a chloroplast transit peptide/pyruvate orthophosphate di-kinase 
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(CTP/PPDK), which sequence is under expression control of a plant operable 
promoter. 

11. A polynucleotide according to the preceding claim, wherein the CTP and PPDK 
5 encoding sequences are non-heterologous with respect to each other. 

12. A polynucleotide according to any one of claims 5-11, wherein the said enhancing 
region constitutes a sequence, the 3' end of which is at least 40 nucleotides upstream 
of the closest transcriptional start of the sequence from which the enhancer is 

10 obtained. 

13. A polynucleotide according to claim 12, wherein the enhancing region constitutes a 
region the 3' end of which is at least 60 nucleotides upstream of the said closest start. 

15 14. A polynucleotide according to claim 12, wherein the said enhancing region 

constitutes a sequence the 3' end of which is at least 10 nucleotides upstream from the 
first nucleotide of the TATA consensus of the sequence from which the enhancer is 
obtained. 

20 15. A polynucleotide according to any one of claims 5-14, comprising first and second 
transcriptional enhancers which are tandemly present in the polynucleotide. 

16. A polynucleotide according to any one of claims 5-15, wherein the 3' end of the 
enhancer, or first enhancer, is between about 100 to about 1000 nucleotides upstream 

25 of the codon corresponding to the translational start of the EPSPS transit peptide, or 

the first nucleotide of an intron in the 5 f untranslated region. 

17. A polynucleotide according to any one of claims 5-16, wherein the 3' end of the 
enhancer, or first enhancer, is between about 150 to about 1000 nucleotides upstream 

30 of the codon corresponding to the translational start of the EPSPS transit peptide, or 

the first nucleotide of an intron in the 5 1 untranslated region. 
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A polynucleotide according to any one of claims 5-17, wherein the 3' end of the 
enhancer, or first enhancer, is between about 300 to about 950 nucleotides upstream 
of the codon corresponding to the translational start of the EPSPS transit peptide, or 
the first nucleotide of an intron in the 5' untranslated region. 

A polynucleotide according to any one of claims 5-18, wherein the 3' end of the 
enhancer, or first enhancer, is between about 770 and about 790 nucleotides upstream 
of the codon corresponding to the translational start of the EPSPS transit peptide, or 
the first nucleotide of an intron in the 5' untranslated region. 

A polynucleotide according to any one of claims 5-19, wherein the 3' end of the 
enhancer, or first enhancer, is between about 300 and about 380 nucleotides upstream 
of the codon corresponding to the translational start of the EPSPS transit peptide, or 
the first nucleotide of an intron in the 5* untranslated region. 

A polynucleotide according to any one of claims 5-18 and 20, wherein the 3' end of 
the enhancer, or first enhancer, is between about 320 and about 350 nucleotides 
upstream of the codon corresponding to the translational start of the EPSPS transit 
peptide, or the first nucleotide of an intron in the 5* untranslated region. 

A polynucleotide according to any one of claims 5-21, wherein the region upstream 
of the promoter from the EPSPS gene comprises at least one enhancer derived from a 
sequence which is upstream from the transcriptional start of either the CaMV35S or 
FMV35S promoters. 

A polynucleotide according to any one of claims 5 -21, wherein the region upstream 
of the promoter from the EPSPS coding sequence comprises at least one enhancer 
derived from a sequence which is upstream from the transcriptional start of a 
promoter selected from the group consisting of those of the actin, rolDFd, S-adenosyl 
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homocysteinase, historie, tubulin, polyubiquitin and plastocyasnin genes and the 
CaMV35S andFMV35S genes. 

24. A polynucleotide according to claim 23, comprising in the 5 ! to 3 f direction a first 
5 enhancer comprising a transcriptional enhancing region derived from a sequence 

which is upstream from the transcriptional start of either the CaMV35S or FMV35S 
promoters, and a second enhancer comprising a transcriptional enhancing region 
derived from a sequence which is upstream from the transcriptional start of an actin 
gene. 

10 

25. A polynucleotide according to claim 23, comprising in the 5* to 3' direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence 
which is upstream from the transcriptional start of either the CaMV35S or FMV35S 
promoters, and a second enhancer comprising a transcriptional enhancing region 

15 derived from a sequence which is upstream from the transcriptional start of the 

rolDfd gene. 

A polynucleotide according to claim 23, comprising in the 5 f to 3 f direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence 
which is upstream from the transcriptional start of either the CaMV35S or FMV35S 
promoters, and a second enhancer comprising a transcriptional enhancing region 
derived from a sequence which is upstream from the transcriptional start of a histone 
gene. 

A polynucleotide according to claim 23, comprising in the 5 1 to 3' direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence 
which is upstream from the transcriptional start of either the CaMV35S or FMV35S 
promoters, and a second enhancer comprising a transcriptional enhancing region 
derived from a sequence which is upstream from the transcriptional start of a tubulin 
gene. 
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28. A polynucleotide according to claim 23, comprising in the 5' to 3 1 direction a first 
enhancer comprising a transcriptional enhancing region derived from a sequence 
which is upstream from the transcriptional start of the FMV35S promoter and a 
second enhancer comprising a transcriptional enhancing region derived from a 

5 sequence which is upstream from the transcriptional start of the CaMV35S promoter. 

29. A polynucleotide according to any one of claims 5-28, wherein the nucleotides 5' of 
the codon which constitutes the translational start of the soybean EPSPS chloroplast 
transit peptide are Kozack preferred. 

10 

30. A polynucleotide according to any one of claims 8 to 29, wherein 5 1 of the genomic 
sequence which encodes the EPSPS CTP there is located a 5 f untranslated leader 
sequence derived from a relatively highly expressed gene 

15 31. A polynucleotide according to the preceding claim, wherein the said highly expressed 
gene is selected from the group consisting of those which encode glucanase, chalcone 
synthase and Rubisco. 

32. A polynucleotide according to. any one of claims 8-31, which comprises a virally . 
20 derived translational enhancer or non-viral translational enhancer located within the 

non translated region 5 1 of the genomic sequence which encodes the EPSPS 
chloroplast transit peptide. 

33. A polynucleotide according to any one of claims 5-32, further comprising regions 

25 encoding proteins capable of conferring upon plant material containing it at least one 

of the following agronomically desirable traits: resistance to insects, fungi, viruses, 

# 

bacteria, nematodes, stress, dessication, and herbicides. 

34. A polynucleotide according to claim 33, wherein the herbicide is other than 
30 glyphosate. 
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35. A polynucleotide according to the preceding claim wherein the herbicide is a 
protoporphyrinogen oxidase (PPGO) inhibitor, in particular butefenicil, and the 
corresponding resistance protein is a PPGO or inhibitor resistant variant thereof. 

5 36. A polynucleotide according to either of claims 33 to 35, wherein the insect resistance 
conferring regions encode crystal toxins derived from Bt, including secreted Bt 
toxins; protease inhibitors, lectins, Xenhorabdus/Photorhabdus toxins; the fungus 
resistance conferring regions are selected from the group consisting of those encoding 
known AFPs, defensins, chitinases, glucanases, Avr-Cf9; the bacterial resistance 

[0 conferring regions are selected from the group consisting of those encoding cecropins 

and techyplesin and analogues thereof; the virus resistance regions are selected from 
the group consisting of genes encoding virus coat proteins, movement proteins, viral 
replicases, and antisense and ribozyme sequences which are known to provide for 
virus resistance; the stress, salt, and drought resistance conferring regions are selected 

15 from those that encode Glutathione-S-transferase and peroxidase, the sequence which 

constitutes the known CBF1 regulatory sequence and genes which are known to 
provide for accumulation of trehalose. 

37. A polynucleotide according to claim 36, wherein the insect resistance conferring 
20 regions are selected from the group consisting of those that encode cryLAc, crylAb, 

ciy3A, crylBC, Vipl A, ViplB, Vip3A, Vip3B, cystein protease inhibitor, and 
snowdrop lectin. 

38. A polynucleotide according to any preceding claim, which is modified in that mRNA 
25 instability motifs and/or unwanted splice regions are removed, or crop preferred 

codons are used so that expression of the thus modified polynucleotide in a plant 
yields substantially similar protein having a substantially similar activity/function to 
that obtained by expression of the protein encoding regions of the unmodified 
polynucleotide in the organism in which they are endogenous. 
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39. A polynucleotide according to the preceding claim, wherein the degree of identity 
between the modified polynucleotide and a polynucleotide endogenously contained 
within the said plant and encoding substantially the same protein is such as to prevent 
co-suppression between the modified and endogenous sequences. 

40. A polynucleotide according to the preceding claim, wherein the said degree is less 
than about 70%. 

41 . A vector comprising the polynucleotide of any preceding claim. 

42. Plant material which has been transformed with the polynucleotide of any one of 
claims 5-40, or the vector of claim 41. 



43 . Plant material which has been transformed with the polynucleotide of any one of 

15 claims 5-40 or the vector of claim 41 , and which has been, or is, further transformed 

with a polynucleotide comprising regions encoding proteins capable of conferring 
upon plant material containing it at least one of the following agronomically desirable 
traits: resistance to insects, fungi, viruses, bacteria, nematodes, stress, desiccation, 
and herbicides. 

20 

44. Morphologically normal, fertile whole plants which have been regenerated from the 
material according to either of claims 42 or 43, their progeny seeds and parts. 

45. Morphologically normal fertile whole plants which result from the crossing of pints 
25 which have been regenerated from material which has been transformed with the 

polynucleotide of any one of claims 5-40 and plants which result from regeneration 
of material transformed with a polyncleotide which comprises a sequence which 
encodes a protein capable of providing for elevated levels of phosphoenolpyruvate in 
the chloroplast 

30 
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Plants according to the preceding claim, wherein the said capable protein is a fusion 
of a CTP and PPS or a fusion of a CTP and a PPDK. 

Morphologically normal fertile whole plants which comprise the polynucleotide of 
any one of claims 5-40 and which result from the crossing of plants which have been 
regenerated from material transformed with the polynucleotide of any one of claims 5 
- 40 or the vector of claim 41, and plants which have been transformed with a 
polynucleotide comprising regions encoding a CTP/PPS or CTPPPDK, and/or 
regions encoding proteins capable of conferring upon plant material containing it at 
least one of the following agronomically desirable traits: resistance to insects, fungi, 
viruses, bacteria, nematodes, stress, desiccation, and herbicides, the progeny of the 
resultant plants, their seeds and parts. 

Plants according to any one of claims 44 to 47, selected from the group consisting of 
field crops, fruits and vegetables such as canola, sunflower, tobacco, sugar beet, 
cotton, maize, wheat, barley, rice, sorghum, tomato, mango, peach, apple, mangel 
worzel, pear, strawberry, banana, melon, potato, carrot, lettuce, cabbage, onion, soya 
spp, sugar cane, pea, field beans, poplar, grape, citrus, alfalfa, rye, oats, turf and 
forage grasses, flax and oilseed rape, and nut producing plants insofar as they are not 
already specifically mentioned, their progeny, seeds and parts. 

Soybean, canola, brassica, cotton, sugar beet, sunflower, peas, potatoes and mangel 
worzels according to any one of claims 44-48. 

A method of selectively controlling weeds in a field, the field comprising weeds and 
plants or progeny according to any one of claims 44-49, the method comprising 
application to the field of a glyphosate type herbicide in an amount sufficient to 
control the weeds .without substantially affecting the plants. 

A method according to the preceding claim, further comprising application to the 
field either before or after application of the glyphosate herbicide of one or more of 
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the following: a herbicide, insecticide, fungicide, nematicide, bactericide and an 
anti-viral. 

A method of producing plants which are substantially tolerant or substantially 
resistant to glyphosate herbicide, comprising the steps of: 

(i) transforming plant material with the polynucleotide of any one of claims 5 to 
40 or the vector of claim 41 ; 

(ii) selecting the thus transformed material; and 

(iii) regenerating the thus selected material into morphologically normal fertile 
whole plants. 

A method according to the preceding claim, wherein the transformation involves the 
introduction of the polynucleotide into the material by: (i) biolistic bombardment of 
the material with particles coated with the polynucleotide; or (ii) impalement of the 
material on silicon carbide fibres which are coated with a solution comprising the 
polynucleotide; or (iii) introduction of the polynucleotide or vector into 
Agrobacteritcm and co-cultivation of the thus transformed Agrobacterium with plant 
material which is thereby transformed and is subsequently regenerated. 

A method according to the preceding claim, wherein the transformed material is 
selected by its resistance to glyphosate. 

Use of the polynucleotide of any one of claims 5 to 40, or the vector of claim 41, in 
the production of plant tissues and/or morphologically normal fertile whole plants 
which are substantially tolerant or substantially resistant to glyphosate herbicide. 

Use of the polynucleotide of any one of claims 5 to 40, or the vector of claim 41, in 
the production of aherbicidal target for the high throughput in vitro screening of 
potential herbicides. 
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57. A method of selecting biological material transformed so as to express a gene of 
interest, wherein the transformed material comprises the polynucleotide of any one of 
claims 5 to 40 or the vector of claim 41 and wherein the selection comprises exposing 
the transformed material to glyphosate or a salt thereof, and selecting surviving 

5 material. 

58. A method according to the preceding claim, wherein the biological material is of 
plant origin. 



10 59. A method according to the preceding claim, wherein the plant is a dicot 

60. A method according to the preceding claim, wherein the dicot is selected from group 
.consisting of soybean, cotton, sugar beet, Brassica napus, campestris or oleraceae. 

15 61 . A method for regenerating a fertile transformed plant to contain foreign DNA 
comprising the steps of: 

(a) producing regenerable tissue from said plant to be transformed; 

(b) transforming said regenerable tissue with said foreign DNA, wherein said 
foreign DNA comprises a selectable DNA sequence, wherein said sequence functions 

20 in a regenerable tissue as a selection device; 

(c) between about one day to about 60 days after step (b), placing said 
regenerable tissue from step (b) in a medium capable of producing shoots from said 
tissue, wherein said medium may further contain a compound used to select 
regenerable tissue containing said selectable DNA sequence to allow identification or 

25 selection of the transformed regenerated tissue; 

(d) after at least one shoot has formed from the selected tissue of step (c) 
transferring said shoot to a second medium capable of producing roots from said 
shoot to produce a plantlet, wherein the second medium optionally contains the said 
compound; and 

30 (e) growing said plantlet into a fertile transgenic plant wherein the foreign DNA 

is transmitted to progeny plants in Mendelian fashion, wherein between step (b) and 
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step (c) there is an optional step of placing the transfonned material onto callus 
inducing medium, characterised in that the foreign DNA is, or the selectable DNA 
sequence comprised by the foreign DNA comprises, the polynucleotide according to 
any one of cl aims 5 to 40, and the said compound is glyphosate or a salt thereof. 

A method according to the preceding claim, wherein the plant is a dicot selected from 
the group consisting soybean, cotton, sugar beet, Brassica napus, campestris or 
oleraceae. 

A method according to either of claims 61 or 62, wherein the said regenerable tissue 
is selected from the group consisting of embryogenic calli, somatic embryos, 
immature embryos etc. 

A diagnostic kit comprising means for detecting the enzyme of any one of claims 1-4 
or that encoded by the polynucleotide of any one of claims 5 to 40 or the 
polynucleotide or vector of any one of claims 5 to 41. 
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SEQUENCE LISTING 

<110> Syngenta Limited 

<120> HERBICIDE RESISTANT PLANTS 

<130> PPD50594/WO 

<140> PCT/GB01/04131 

<141> 2001-09-14 

<150> GB0023911.1 

<151> 2000-09-29 

<150> GB0027693.1 

<151> 2000-11-13 

<150> GB0023910.3 

<151> 2000-09-29 

<160> 70 

<170> Patentln version 3.1 

<210> 1 

<211> 3836 

<212> DNA 

<213> Brass ica napus 

<400> 1 

gcatgcagat cttaaaggct cttttccagt ctcacctacc aaaactataa gaaaatccac 60 

1 
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ttgctgtctg aaatagccga cgtggataaa gtacttaaga cgtggcacat tattattgac 120 

tactagaaaa aaaactcata caccatcgta ggagttgggg ttggtgaaga atttgatggg 180 

tgcctctccc ccccccactc accaaactca tgttctttgt aaagccgtca ctacaacaac 240 . 

aaaggagacg acagttctat agaaaagctt tcaaattcaa tcaatggcgc aatctagcag 300 

aatctgccat ggcgtgcaga acccatgtgt tatcatctcc aatctctcca aatccaacca 360 

aaacaaatca cctttctccg tctccttgaa gacgcatcag cctcgagctt cttcgtgggg 420 

attgaagaag agtggaacga tgctaaacgg ttctgtaatt cgcccggtta aggtaacagc 480 

ttctgtttcc acgtccgaga aagcttcaga gattgtgctt caaccaatca gagaaatctc 540 

gggtctcatt aagctacccg gatccaaatc tctctccaat cggatcctcc ttcttgccgc 600 

tctatctgag gtacatatac ttgcttagtg ttaggccttt gctgtgagat tttgggaact 660 

atagacaatt tagtaagaat ttatatataa tttttttaaa aaaaatcaga agcctatata 720 

tatttaaatt tttccaaaat ttttggaggt tataggctta tgttacacca ttctagtctg 780 
catctttcgg tttgagactg aagaatttta ttttttaaaa aattattata gggaactact 840 
gtagtggaca acttgttgaa cagtgatgac atcaactaca tgcttgatgc gttgaagaag 900 
ctggggctta acgtggaacg tgacagtgta aacaaccgtg cggttgttga aggatgcggt 960 

ggaatattcc cagcttcctt agattccaag agtgatattg agttgtacct tgggaatgca 1020 

ggaacagcca tgcgtccact caccgctgca gttacagctg caggtggcaa cgcgaggtaa 1080 

ggttaacgag ttttttgtta ttgtcaagaa attgatcttg tgtttgatgc ttttagtttg 1140 

gtttgttttc tagttatgta cttgatgggg tgcctagaat gagggaaaga cctataggag 1200 

atttggttgt tggtcttaag cagcttggtg ctgatgttga gtgtactctt ggcactaact 1260 

gtcctcctgt .tcgtgtcaat gctaatggtg gccttcccgg tggaaaggtg atcttcacat 1320 

ttactctatg aattgtttgc agcagtcttt gttcatcaca gcctttgctt cacattattt 1380 

catcttttag tttgttgtta tattacttga tggatcttta aaaaggaatt gggtctggtg 1440 
tgaaagtgat tagcaatctt tctcgattcc ttgcagggcc gtgggcatta ctaagtgaaa 1500 
cattagccta ttaaccccca aaatttttga aaaaaattta gtatatggcc ccaaaatagt 1560 
ttttaaaaaa ttagaaaaac ttttaataaa tcgtctacag tcccaaaaat cttagagccg 1620 
gccctgcttg tatggtttct cgattgatat attagactat gttttgaatt ttcaggtgaa 1680 
gctttctgga tcgatcagta gtcagtactt gactgccctc ctcatggcag ctcctttagc 1740 
tcttggagac gtggagattg agatcattga taaactgata tctgttccat atgttgaaat 1800 
gacattgaag ttgatggagc gttttggtgt tagtgccgag catagtgata gctgggatcg 1860 
tttctttgtc aagggcggtc agaaatacaa gtaatgagtt cttttaagtt gagagttaga 1920 

2 
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ttgaagaatg aatgactgat taaccaaatg gcaaaactga ttcaggtcgc ctggtaatgc 1980 

ttatgtagaa ggtgatgctt ctagtgctag ctacttcttg gctggtgctg ccattactgg 2040 

tgaaactgtt actgtcgaag gttgtggaac aactagcctc caggtagttt atccactctg 2100 

aatcatcaaa tattatactc cctccgtttt atgttaagtg tcattagctt ttaaattttg 2160 

tttcattaaa agtgtcattt tacattttca atgcatatat taaataaatt ttccagtttt 2220 

tactaattca ttaattagca aaatcaaaca aaaattatat taaataatgt aaaattcgta 2280 

atttgtgtgc aaatacctta aaccttatga aacggaaacc ttatgaaaca gagggagtac 2340 

taattttata ataaaatttg attagttcaa agttgtgtat aacatgtttt gtaagaatct 2400 

aagctcattc tctttttatt ttttgtgatg aatcccaaag ggagatgtga aattcgcaga 2460 

ggttcttgag aaaatgggat gtaaagtgtc atggacagag aacagtgtga ctgtgactgg 2520 

accatcaaga gatgcttttg gaatgaggca cttgcgtgct gttgatgtca acatgaacaa 2580 

aatgcctgat gtagccatga ctctagccgt tgttgctctc tttgccgatg gtccaaccac 2640 

catcagagat ggtaaagcaa aaccctctct ttgaatcagc gtgttttaaa agattcatgg 2700 

ttgcttaaac tctatttggt caatgtagtg gctagctgga gagttaagga gacagagagg 2 760 

atgattgcca tttgcacaga gcttagaaag gtaagtttcc ttttctctca tgctctctca 2820 

ttcgaagtta atcgttgcat aactttttgc ggtttttttt tttgcgttca gcttggagct 2880 

acagtggaag aaggttcaga ttattgtgtg ataactccac cagcaaaggt gaaaccggcg 2940 

gagattgata cgtatgatga tcatagaatg gcgatggcgt tctcgcttgc agcttgtgct 3000 

gatgttccag tcaccatcaa ggatcctggc tgcaccagga agactttccc tgactacttc 3060 

caagtccttg aaagtatcac aaagcattaa aagacccttt cctctgatcc aaatgtgaga 3120 

atctgttgct ttctctttgt tgccactgta acatttatta gaagaacaaa gtgtgtgtgt 3180 

taagagtgtg tttgcttgta atgaactgag tgagatgcaa tcgttgaatc agttttgggc 3240 

cttaataaag ggtttaggaa gctgcagcga gatgattgtt tttggtcgat catctttgaa 3300 

aatgtgtttg tttgagtaat ttttctaggg ttgagttgat tacactaaga aacacttttt 3360 

gattttctat tacacctata gacacttctt acatgtgaca cactttgttg ttggcaagca 3420 

acagattgtg gacaattttg cctttaatgg aaaggacaca gttgtggatg ggtgatttgt 3480 

ggacgattca tgtgtggtta ggtgatttgt ggacggatga tgtgtagatg agtgatgagt 3540 
aatgtgtgaa tatgtgatgt taatgtgttt atagtagata agtggacaaa ctctctgttt 3600 
tgattccata aaactataca acaatacgtg gacatggact catgttacta aaattatacc 3660 
gtaaaacgtg gacacggact ctgtatctcc aatacaaaca cttggcttct tcagctcaat 372 0 
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tgataaatta tctgcagtta aacttcaatc aagatgagaa agagatgata ttgtgaatat 3780 
garscggaga gagaaatcga agaagcgttt accttttgtc ggagagtaat gaattc 3B36 

<210> 2 

<211> 31 

<212> DNA 

<213> Primer 



<400> 2 31 
gcatgcagat cttaaaggct cttttccagt c 



<210> 3 

<211> 32 

<212> DNA 

<213> Primer 



<400> 3 32 
gaattcatta ctctccgaca aaaggtaaac gc 



<210> 4 

<211> 20 

<212> DNA 

<213> Primer 



<400> 4 20 
gcgcaatcta gcagaatctg 



<210> 5 

<211> 20 

<212> DNA 

<213> Primer 



<400> 5 20 
tccttcttgc cgctctatct 



4 
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<210> 6 

<211> 20 

<212> DNA 

<213> Primer 

<400> 6 

gcgttgaaga agctggggct 

<210> 7 

<211> 20 

<212> DNA 

<213> Primer 

<400> 7 

acctatagga gatttggttg 

<210> 8 

<211> 20 

<212> DNA 

<213> Primer 

<400> 8 

actaagtgaa acattagcct 

<210> 9 

<211> 20 

<212> DNA 

<213> Primer 

<400> 9 

gttccatatg ttgaaatgac 

<210> 10 
<211> 20 
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<212> DNA 
<213> Primer 

<400> 10 

tccactctga atcatcaaat 

<210> 11 

<211> 20 

<212> DNA 

<213> Primer 

<400> 11 

gtaagaatct aagctcattc 

<210> 12 

<211> 20 

<212> DNA 

<213> Primer 

<400> 12 

tcatggttgc ttaaactcta 

<210> 13 

<211> 20 

<212> DNA 

<213> Primer 

<400> 13 

cagcttgtgc tgatgttcca 

<210> 14 

<211> 20 

<212> DNA 

<213> Primer 
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<400> 14 

cgatcatctt tgaaaatgtg 



20 



<210> 15 

<211> 20 

<212> DNA 

<213> Primer 

<400> 15 

ctctctgttt tgattccata 20 

<210> 16 

<211> 25 

<212> DNA 

<213> Primer 



<210> 17 

<211> 29 

<212> DNA 

<213> Primer 

<400> 17 

ggtgagtgaa cgcatggcta ttcctgcat 2 9 

<210> 18 

<211> 29 

<212> DNA 

<213> Primer 



<400> 16 

cgcatcagcc tcgagcttct tcgtg 



25 



<400> 18 

atgcaggaat agccatgcgt tcactcacc 



29 



7 
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<210> 19 

<211> 25 

<212> DNA 

<213> Primer 



<400> 19 

catttcaaca tatggaacag at ate 25 



<210> 20 

<211> 90 

<212> DNA 

<213> Primer 



<400> 20 

ccegggatat tggaagttaa aggaaaagag agaaagagaa atctttctgt ctaagtgtaa 60 
ttaaccatgg cgcaatctag cagaatctgc 90 



<210> 21 

<211> 20 

<212> DNA 

<213> Primer 



<400> 21 

ggacgtggaa acagaagctg 

<210> 22 

<211> 20 

<212> DNA 

<213> Primer 



<400> 22 

tctcaaaccg aaagatgcag 20 



<210> 23 
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<211> 31 
<212> DNA 
<213> Primer 



<400> 23 

ggttgaagca caatctctga agcttccatg g 31 



<210> 24 

<211> 31 

<212> DNA 

<213> Primer 



<400> 24 

gatcttgggt cattttaatg ggccttctaa c 31 



<210> 25 

<211> 34 

<212> DNA 

<213> Primer 



<400> 25 

aaataaaccc attaaccact aatgaaaaag ggat 34 



<210> 26 

<211> 28 

<212> DNA 

<213> Primer 



<400> 26 

gcggagctcg agacttttca acaaaggg 28 



<210> 27 
<211> 35 
<212> DNA 
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<213> Primer 



<400> 27 

atcccgggaa gcatgcgaag gatagtggga ttgtg 35 



<210> 28 

<211> 23 

<212> DNA 

<213> Primer 



<400> 28 

cgcccgggat tgaatttbaa age 23 



<210> 29 

<211> 209 

<212> DNA 

<213> Enhancer 

<400> 29 

tctagagcag ctggcttgtg gggaccagac aaaaaaggaa tggtgcagaa ttgttaggcg 60 

caret accaa aagcatcttt gectttattg caaagataaa gcagattcct ctagtacaag 120 

tggggaacaa aataacgtgg aaaagagctg tcctgacagc ccactcacta atgcgtatga 180 

egaaegcagt gacgaccaca aaagagctc 209 

<210> 30 

<211> 309 

<212> DNA 

<213> Enhancer 

<400> 30 

tgagactttt caacaaaggg taatatcegg aaacctcctc ggattccatt gcccagctat 60 

ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc tcctacaaat gecatcattg 120 

cgataaagga aaggecateg ttgaagatgc ctctgccgac agtggtccca aagatggacc 180 

cccacccacg aggagcatcg tggaaaaaga agaegttcca accacgtctt caaagcaagt 240 



10 
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ggattgatgt gatatctcca ctgacgtaag ggatgacgca caatcccact atccttcgca 300 
agacccttc 309 

<210> 31 

<211> 15941 

<212> DNA 

<213> Glycine max 



<400> 31 
aaaaaatacc 


ttttagtaaa ataagagcgt ragatcaatc agacgttttc tctttgggac 


60 


ttctcattct 


taattgaatt gacaataact aaagtgaaac taaggctaaa atcaactcgc 


120 


ctagtcaagc 


tcgtccacaa aaataggttt ttgaaagttt accatgtcag tttcttacta 


180 


agtaaaaaag 


gatcatttgt aaggtccaac gccttaaaat gatcaccctt caaagtaaaa 


240 


oaatcacttcr 


tttcacgcat aagtaagaac tacgtaggtc tgatttcctc tccaaaggag 


300 


qq t a c ci t acjcr 


agcaaaagcc ccgcttttgt cgaccttaaa aaataaaaag aaacaaaagt 


360 


taaggtaaca 


caatttccac aattctagaa ataaggttgt tgtctttcga gacaaacgta 


420 


agaggtgcta 


ataccttcct caaacgtaaa tacaactccc gaacttagaa ttttcatttt 


480 


gaccggtttc 


cttcggtttt cccgacgttt tccacaaata aacgttggtg gcgactccgc 


540 


gcatctttcc 


tcctttggaa agcgcaccca tgagcctcgc cctcgctcgc ccgcgaaggg 


600 


cacgttgcga 


cactaggtta ataagaagtt tagtccacat atctatggac ctcttaaggt 


660 


gattatcaaa 


gtggaagtgg tgacctacat gttggcccta ccagcccaT-u cguaLatttd 


79 0 


cccccatcaa 


ccatgtatct cagctgaaac atgctatcaa tccaaatgtt atcagaggaa 


780 


catgagctgc 


atgttcaacc agaagtagtt ctagtcttaa tacaaatgtc aaatggcaca 


840 


agtgagattt 


tgaatttctg atgttgtaaa aatctcagga catgaatact attgggaagc 


900 


aattattcat 


acttcaccaa tccaaactga cccaaaattc tcaaatcaca tgaaagcaaa 


960 


aatgcatata 


acacgaagaa taagaagaag aggaactaac ctggggtttc gatgattaaa 


1020 


gcgttgttgt 


tgatgatgaa aacgatgatt atggagagaa attgttgttg aatggtgaaa 


1080 


ttgttataga 


aagagaacga agatagagaa aaagatatat agatttttca aggctcaaac 


1140 


cctaaaatca 


ccatgagaga gaacaaagat tgagaaacct acaaccacta tgagagagaa 


1200 


tgagcagaac 


agaagcgtga gatagagaac gagagttaag gtgcgagagg acacgaagaa 


1260 


caaaaggtgt 


gagagaaaga acaaaggagc ctacggtgtg agatgagaga atttgaaatt 


1320 


cttaccattt 


aggtggaatt tcaattctac aattttattc tattaaaatt attttaaaaa 
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atgatgtcat tttaaattct ttaaaatctc atatccaaca actgaattat gatagaggta 1440 

tttcaaattc acttaaaaaa attatcttat ttaaataccc catccaaaca tagcgaaatg 1500 

ttcatgagaa ggatcaagtg gtttggaaac atagtactaa tggtgtttat acagttcatg 1560 

gaatccttga tagataattt aaaggttgct ggaaattgga tgaaggtgtg gagattaaat 1620 

attcttccaa aaataaagcg ttttatttgg agagtgttgt gtggttgtct cccctgtagg 1680 

caaaagcttc gatgtaaagg agttcaatgt ccaataacct atgctttcta tccctcgatt 1740 

attgaaaatg aatgacacat tttatttggt tgaaatcaag aaataagcat gtggcaagca 1800 

acgggtattt gacaattcat agaacaaaag gtgaatgcag caaaagaatt aatgaactcc 1860 

ttttcgatct acttggatca ctacatggag atattatcaa caaatttgat gttactttat 1920 

ggagcagttg gaattcttgg aatgacaaga tatgaaatga acataccaac cctcctcttg 1980 

tttctgtttc ggtttctatg cagtattttg ttgaatggca aagtgcaagg taatatgctc 2040 

ctcaacatca attaacaaat gttcatgaca tctcttacca gctccaactt ggggacgttt 2100 

gacaaacacc accgtcaagt ttccttaaat gcaacattaa tgttgctcat ttcaaggagg 2160 

agaatagttt tggtgtcggc atgatactcc atcaaggaag attcgtcaaa gctcactcac 2220 

gttttcgaca tgggtcgaca tgggttacct gacccaaagg ctgaggctta ggcttgggtt 22 80 

tgcttcaagt attgatctgg gcccagacta ttggtttaca taatatcatt tttgaaaacc 2340 

taacatctaa aactcaaggt tgtttagagg tgcgccattc caaaataaga ttatcctatt 2400 

tgtgcatgaa tgcgaccaac tatctcctgt ttcagcatta taaagtataa acaacaaact 2460 

tctttaatca agggactaaa agatattgga catacaagct aaaagtgata gaatttgaga 2520 

aaacaaatat tgacaacaat attcaagagg acactaaaac ataattctca aatttttttt 2580 

gtttatttaa aataaagtgg ttcattaggt agctcttgaa aaaaaaaagt gattaacatg 2640 

ctagtacatg ttaaaaaaag tgaataatat tctattataa gtaaagagga ataatttatc 2700 

gatgtctcat ttttgagata gttcaccgaa gccaattttt ttactttttt ctcttcaatc 2760 

aattttattt tgttaatagt tgtcattgtg gatggattat aataattttg aaatgattta 2820 
aaatgagata tatattaata aatttaatta attcataaat atgttactaa atataaattt 2880 
catggtatac tcaggacggt tataacggaa. atcaaataaa taataaaata gaataatatt 2940 
aatttgcagt aaaaagaata atattaattt attctgttta cactatgtat aacgaaatta 3 000 
aatatattca tttaatatga ttttattttt ctttaattta aaaacatgaa ttatttgttt 3 060 
cttattttat taccattttg ttcacttctc aattaataaa ataaaaatat attctacaat 3120 
tatagtaaaa aggtttaatt ttatataatg ttagcataca gatttttctc tattaactaa 3180 
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ttaattaaaa attatcctta ttttaaggaa attaaaaatt atcatatata tataagtttt 3240 

aaattaatta tcttatatat gtaccaaaaa gttttaaagc aattattata aaaattaata 3300 

aatttatcat ataaaataat ttataattaa attttaaatt atcaattcat taaattaaat 3360 

tatttaaaat ttttgaatga taatataata attttatcct ctactaagtc ccaacgtttc 3420 

ctattttatt ccacttttag caataaattt tgtcataaac acttataaca aaaaaagtaa 3480 

gtaaaaaata aaaaaaagtt tttcaataaa gtataaacta atttgtataa acttttagaa 3540 

aaaataaagt tatacattga taatataaat tttttacata attatccgat caactcatta 3600 

tatatgataa atttattgat tttttaaaat aattatctta aaataattta aacaatgatt 3660 

tgcaattaga tgataatata aaattatttt acacactaca tgtattaaac tcaaactttt 3720 

atatattagt ttttctaaaa actaattttt aactcaaaaa aaatgttact tataattttc 3780 

ttatcttctt tttttataag tattttttaa gaaatttatt gaaacatgac catgcttggg 3840 

tcaataatac tactctctta gacaccaaac aacccttccc aaactataat ctaatccaaa 3900 

agccatcatt cattttcctt ggtaggtaaa gttccaagac cttcaccaac tttttcactc 3960 

aattgttttg gtgtaagcaa ttcgacatgt gttagtgtta gttggcaacc aaaaatccct 4020 

ttatgtgact caatccaaca accactcaca ccaccaaccc ccataaccat ttctcacaat 40 80 
acccttcatt tacacattat catcaccaaa aataaataaa aaaaacctct catttcagag 4140 
agagagagag agacttcaca gaccaaagtg cagagaacaa caaagttcac aactttaagg 4200 
aaaattgaaa tggcccaagt gagcagagtg cacaatcttg ctcaaagcac tcaaattttt 4260 
ggccattctt ccaactccaa caaactcaaa tcggtgaatt cggtttcatt gaggccacgc 4320 
ctttgggggg cctcaaaatc tcgcatcccg atgcataaaa atggaagctt tatgggaaat 4380 
tttaatgtgg ggaagggaaa ttccggcgtg tttaaggttt ctgcatcggt cgccgccgca 4440 
gagaagccgt caacgtcgcc ggagatcgtg ttggaaccca tcaaagactt ctcgggtacc 4500 
atcacattgc cagggtccaa gtctctgtcc aatcgaattt tgcttcttgc tgctctctct 4560 
gaggtgaagt ttatttattt atttatttgt ttgtttgttg ttgggtgtgg gaataggagt 4620 
ttgatgtgta gagtggattt tgaatatttg attttttttt gtattattct gtgaaaatga 4680 
agcatcatgt cccatgaaag aaatggacac gaaattaagt ggcttatgat gtgaaatgag 4740 
gatagaaatg tgtgtagggt tttttaatgg gtagcaataa gcatattcaa tatctggatt 4800 
gatttggacg tttctgtata aaggagtatg ctagcaatgt gttaatgtat ggcttgctaa 4860 
aatactccta aaaatcaagt gggagtagta tacatatcta cagcaaatgt attaggtgag 4920 
gcatttggct tctctattgt aaggaacaaa taatatcagt taatgtgaaa atcaatggtt 4980 
gatattccaa tacattcatg atgtgttatt tatatgtacc taatattgac tgttgttttt 5040 
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ctccgcaatg 


accaagatta 


tttattttat 


ttagagaagt 


tggacccatt 


aggtgagagc 


tgagtcttga 


ttatccaagt 


atggttgtat 


cccccaaaac 


acaaattaga 


atgacatttc 


atttttacaa 


cttttagtta 


cttaacagat 


tgtttttttt 


tagggaacaa 


ctgttgtaga 


catgcttggt 


gcattaagga 


cccttggact 


agcaattgtt 


gaaggctgtg 


ggggattgtt 


caatttattc 


cttggaaatg 


ctggtactgc 


tgcaggtgga 


aatgcaaggt 


ctgttttttt 


cctcgtataa 


ctaatcacaa 


cagagtacgt 


catccagatt 


agtcatcctt 


tcttcttaaa 


aatatttaaa 


ttagcttgtc 


aaagtctagc 


tgcacctgag 


ggtgttcctc 


atgatctact 


atgattactg 


gtttgagtct 


ctaagtagac 


tacttgatgg 


ggtgccccga 


atgagagaga 


agcaacttgg 


tgcagatgtt 


gattgctttc 


atgggaaggg 


aggacttcct 


ggcggaaagg 


gagtaacttt 


cctggatcaa 


aattctaatt 


aataagacta 


agggtaggtt 


taggagttgg 


ttttttcttc 


ttctaataaa 


tattctttaa 


aagatagatt 


agcataactt 


aatgttttaa 


cctgtctatt 


taaaaatcaa 


atatttgtcc 


ccaaatatac 


cgtagttgaa 


ttatattttg 


tcagagatta 


tccctcacct 


ttattatagc 


attcttgttt 


taaaaaacac 


taagttttct 


ccaaagtaaa 


tattgtagca 


agtgttggtt 


acatgcaggt 


gaaactgtct 


ggatcagtta 


cagctccttt 


agctcttggt 


gatgtggaaa 


catatgttga 


aatgactctg 


aagttgatgg 
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cctctaaagt gactaattga gttgcttact 5100 

gtggggggaa ctaatcttga atatacaatc 5160 

gaacaatgtt agctctagaa gataaaccct 5220 

aagttccatg tatgtcactt tcattctatt 5280 

gtcttgttca gcataaatta taatttattc 5340 

caacttgttg tatagtgagg atattcatta 5400 

gcgtgtggaa gatgacaaaa caaccaaaca 5460 

tcccactagt aaggaatcta aagatgaaat 5520 

aatgcgtcct ttgacagcag ctgtggttgc 5580 

ttttttgttc agcataatct ttgaattgtt 5640 

gttcttcttc ctgttataat ctaaaaatct 5700 

aggaaccttt aattatcaat gtatttattt 5760 

atatacatat tttgattata ttctgagaaa 5820 

tcaacctctg ttattattag attttctatc 5880 

catcttgatg ttcaaaatat ttcagctacg 5940 

ggccaattgg ggatttggtt gctggtctta 6000 

ttggcacaaa ctgtccacct gttcgtgtaa 6060 

tatggtttgg atttcattta gaataaggtg 612 0 

taagaagcct ccctgttttc ctctctttag 6180 

gttttggaga gaaatggaag ggagagcaat 6240 

tttgatacat tttttaagta aaagaatata 6300 

tcttttattt atttttataa atattatata 6360 

tccattccct ttcccttcaa aacctcagtt 642 0 

gaaggcctat tggttggaga cttttccttt 6480 

ctttctattt ttaaacttca tatagacgcc 6540 

tttagttata ccttcccctc cttattctct 6600 

gatccattgt gaatcttatt atcctattta 6660 

gcagtcaata cttgactgct ttgcttatgg 672 0 

ttgagattgt tgataaactg atttctgttc 6780 

agcgttttgg agtttctgtg gaacacagtg 6840 
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gtaattggga taggttcttg gtccatggag gtcaaaagta caagtaggtt tctatgtttt 6 900 

agtgttacat cacttttagt atccaaaatg caatgaaatt aaaactcatg tttcctatta 6960 

ggtctcctgg caatgctttt gttgaaggtg atgcttcaag tgccagttat ttactagctg 7020 

gtgcagcaat tactggtggg actatcactg ttaatggctg tggcacaagc agtttacagg 7080 

tattcatgaa tgacactatc ttctattatc ttttatttta tagcttacac tatcattcct 7140 

aattcaaccc ccttcttttc tctttcccct cactcccaaa atagagctta tcattgtatc 7200 

cttaatagct cttttatgtg ctgtttcatc tcttgagcag attctacatg attatccttt 7260 

ttagtgaact tttttgtctt ttaaataatt gatatcatca tttgaacttg cacatcatct 7320 

ctatccttcc tcattttatt atatttcatt aagagtgata aaattacatt gtcaagtaaa 7380 

gcaagttagc gtatcagata atgatgcttc gattgcagaa ttggtgaaag cttgcacaat 7440 

tcattagctt agtgacaatt gtgatgtaga tagccataat cttccatgca acttattatg 7500 

ctttttgtct aaaacttctt ttgcacttaa acatgaatta gagagttagg atgaacaata 7560 

gtaacagaca caaagcttat ttctagacca cccagtggtg aagacgaagc agcatgagtg 7620 

agattaatgg cagcatctga ttgaagcttc ttttggtatt attggagatg aaatcttgag 7680 

ccaaattagg gcttttgcct cctcaatagc cattggttca aaattgcttt caataaaatg 7740 

attacagaat caaattctta agaaagacct tcaagaacaa catcaagatg tttcctgtgc 7800 

aaaacaggct cacctacaaa agagtgtcta caagcacctt tacacttgca aaaaattcct 7860 

taactgaaca attgccaaga gtcatgttgc aaacttgaga ttggagttga ttagatttgg 7920 

tgtgtgtgag attctggaaa tgagcatgaa gttttttgca aagctcataa aatgaatgca 7980 

gccaagaatc ctagatagga ttgaatttga gagagaggct tgaagtcaag ataaccgcaa 8040 

ctgatcttgc tgttcccatg ctgcatagga ctcgttgatt ctaccaagat aacaatcttc 8100 
tgtggtgaga atatgagtgg gaattgtcag gcaacaaaga aaatgaaggc tatgagcctt 8160 
tgttaaggtg cagagagaaa acagaaaatg aagaacttgt gtattgaagg aattgaaaag 8220 
ccaattacaa ttgtgttgca gagagatatt tatagttctg aagtgtataa ctaactaaac 8280 
agaatgctac caacctagag gcagttgaga attgctccct tattacaata ctctcacagc 8340 
cttgatgata ggcttgattt gctgccaccg aagaagaaca ttattgttgg agagtttttc 8400 
agtgattgca tcagagaaag ttacaagtgt gattggagat gaaggacttg cggtcgtcga 8460 
aggttaagtg ttaagtggtg gaatgtgtat cggaggtggt tgcctagccc aagaagctgg 8520 
ttgcagtgga agcggatgcg gcagaagcca tgggttcaga tcagaactat gttctagata 8580 
ccatattagg gttagctttc aagagaaaac ttatgaaact gaactgtaaa ctgttgtatt 8640 
attgattagc tgatctagta gtgatggaag cttgcttgcg aagcttctat ggaggctgga 8700 
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tcattgagct tcaatgaggt ccttcaatgg tgatttttca ccatggagat gcagcggaag 8760 

ataaatgaag aggtgagagg aggcgtcatc cactaggaaa taaactatgg aagaaggagc 8820 

ttcaccacca agagagtgtc ttggataaga agcttagaga ggaagcttca atggaggaat 8880 

agaaagagag agggggaaca aaaaattgaa ggaggaaaag agggagagaa gttgaacttt 8940 

gaagtgtgtc tcacaagact cattcattaa agttacaaca agtgttacac atgcttctat 9000 

ttatagtcta ggtaacttcc ttgagaagct agaacttaac tacacacacc tctctaataa 9060 

ctaagataac ctccttgaga aatttccttg agaagcttcc ttaagaagat tcctagagaa 9120 

gctaaagctt agttacacac acctctctaa taactaagtt cacctccttg agatgagaag 9180 

ctagagcctt agctacacac acccttataa tagctaaact cactctattc caaaatacat 9240 

gaaaatacaa aaaagttcct actacaaaga ttactcaaaa tgtcctgaaa tacaagacta 9300 

aaactctata ctactagaat ggtcaaaata caaggtccaa aagaaggaaa aacctattct 9360 

aatatttaca aagagagtgg acccaacctt agtccatggg ctcagaaatc taccctgagg 9420 

ttcatgggaa tcctagggtc ttctttagta gctctagtcc aattttcttg gagtcttcta 9480 

tccaatatcc ttgggggtag gattgcatca tgtagtacaa gagaggggaa tatatataca 9540 

gttgtaagtt gtaactacct aatcaaaccc aactgactca ccatacacag aaaaacaatg 9600 

tacaaaggat aatagaaaaa ctgaaactga gatatactca ttacatagaa gaattacatg 9660 

atgaatctga aaattttaat tgggaaactg aggacaataa caaggctttc agagttcatt 9720 

ttgacaatag gtaaattatg gacttcaact tagacctcaa gggtatgctt gggagtgggg 9780 

ttttagagga aaaggggaag gagggaaatt ttttaattct agtaaatatt ctctattttg 9840 

attaattttt aaaataaaac aaataaactt atagattagc attcacttaa tgtcctgatc 9900 

ttttatttgt ttcatgaagt tttttatagc ctttctgttt ctgttaaaag aaattggagg 9960 

gtgttcataa tctatatgcc catatatatg tattgtcctg ttgtctgtgt ttctgttttc 10020 

tttctttagt gtctcgcctt tccttaattc cttatcaaca ccttgtttat ttttgaaaac 10080 

ttaaatgtcc tcattttttt tacctgtact ataactttcc aaacaaatag tagaaaatgt 10140 

ttcaaatatt ccccagaact gtttcagtga acaaagttca agccagaaaa aaagcaagca 10200 

gaggctaaag aagtatatct tattgtgaag agtaatctca attggttaat tatcgaattg 10260 

tatcatttta caatagaaaa ttagaagtgg gttgattagt ttatacgaac aattatttaa 10320 

atagggttga tggtgtggtt aattactgat catatgttaa gactatgagt cgatgaccag 10380 

cctcagtgca gtcattttaa gttccattat tttttaattg tgcttatgtt taactttgtc 10440 

cttcttttaa agggagatgt aaaatttgct gaagttcttg aaaagatggg agctaaggtt 10500 
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acatggtcag agaacagtgt cactgtttct ggaccaccac gagatttttc tggtcgaaaa 10560 
gtcttgcgag gcattgatgt caatatgaac aagatgccag atgttgccat gacacttgct 10620 
gttgttgcac tatttgctaa tggtcccact gctataagag atggtatggt ttagctgttt 10680 
gtttttgctc aagagcttgt tttcacatat ttgtggtcaa gagcggttta gctgtttttt 10740 
tttttaaatt gtggttaatg aaattattat aattatacta atattgaaat tggatcacta 10800 
accatgtgaa atgcttgaat ttttagcttg aaaacttctt ttgtggatga tttttctttt 10860 
atctatgtag tgtggggcat ctatgttgtt cctttaagcf ttgtgtcata gttgtacaac 10920 
ctggtctgtg ggctggtcca gttcttactc atgactgtct tttgcagcaa actggttaaa 10980 
cttgggtgac ctgttgaccc acccagattg gcctgttctt ttttaaataa aactccaaaa 11040 
ggctacttat aagatttcaa catgatactt gaatgaccac ccttacttta aaggtcatta 11100 
tgccatcaga ccccttgatt ctctgttcag ttatgttctt attcttatta ttttttaatt 11160 
tatattatta tccaacgatc tgatctttgt acatgatgct ggacaatcat caaagctagt 11220 
ttggttacat catataatac cttttttttc ataaattatt ttttcccata atatatttat 11280 
atgtattttt ttaatacaat attctgggga aatttttttt ttctgttttc ttttttctgg 11340 
tttgaatact atttcaatga ctgggcctta ttattttctg gtttgggtta tacaactgtc 11400 
ctttgatgta cttttgaatg cttcaatagt atttttcata atcatggaag gcactagagt 11460 
gcagtgcata ttgcacaagg ttctttaatg gcatcttgaa tttgttagaa catggagtta 11520 
tgatatacaa ccattcctat gtcttcatct aatagctaaa gcttttggcg tagttggtta 11580 
acatggtatt ggaatcttat tttaaaggtc catcttattc gcaagtacaa ggcaggtgga 11640 
cctgcgcatt attcacgtgc caggctcaaa gggattttgt atgaggggga atttttggaa 11700 
tataatatat agaatcattc atgtggttca cctatcatgt gtcaactcaa gtttttgtcc 11760 
tagttgtatt attgcagtgt ctaaactagg catgatcttc tgtggaagga aatttaacct 11820 
aaggaacttg aatgcttctt gttctgagat ctttattttt tttctccaat ctttttaata 11880 
cttgaaaggt tttagttgta tctataaagg gtgcagaatt agggggtcag cttttctgtt 11940 
gtaagcgtga attaagtgca ttttatgttg ttccctattt tatacatttg aactatcaac 12 000 
catatcattt agcaaacttc tttgttgctc agattcccat ttattcttgt ttcctctatt 12060 
ttttagtggc aagttggaga gttaaagaga ctgagaggat gatagcaatc tgcacagaac 12120 
tcagaaaggt cttgctaatt cctttatggt ttctatatta ctggactttt tacatctcac 12180 
tgacctacta ctgtcttggt atttcagcta ggagcaacag ttgaagaagg tcctgattac 12240 
tgtgtgatta ctccacctga gaaattgaat gtcacagcta tagacacata tgatgaccac 12300 
agaatggcca tggcattctc tcttgctgct tgtggggatg ttccagtaac catcaaggat 12360 
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cctggttgca ccaggaagac atttcctgac tactttgaag tccttgagag gttaacaaag 12420 
cactaagcat tttgtacatt gagtagggaa gagagagaga tatatataaa tactcacaaa 12480 
gcagtgagta tatggcttgc ttttgtttct aaacattcct ttataaggac ttgagataat 12540 
ttgtattgtt taaaagagcc attggttctg ttgtatcaaa gtaatttaat tttcagtacc 12600 
gacttggctc tactcatatg tggtggtgta cattgcatct ttgcatttga ttccatatct 12660 
attgggatgg gaccatacat tttttgttta ttatttaagt caaacatatc agataatatg 12720 
tggtgaccag aaaagtgtct agcaatcaaa tgggtctgtg attgcatgtt tgagttcagg 12780 
acatttatta atgagtgaca gaattttaaa aagctcgcca ataggagttg ccacatctga 12840 
tcgagtcttc tgtaaacact aactgtaact tgagtttgat aatcagattc cccccgttgg 12900 
taggtaatat gtaaatttca ctatagattc ttactgagga tgatggtctt gacttcgaag 12960 
tggtgagtgt aattctctaa tctaaacttt atatatatat aaaaaaagaa tgttaaaaac 13020 
ttgagtgaca tacagcaaat agcaatgaaa gcatgaccat ggccttgggg ggccaaatgg 13080 
ccatgtacca cttttccacc tacttgagag ccatggtcct gaatgaggtc cttttgtttc 13140 
atttttttta caataaagaa agttcatatt cttggcaata gcaaggcaaa tcatttgtgg 13200 
aagtcacatg aaattcacca taggcttcat tgttcaaatc ctttagtcac taaggcaggg 13260 
cctatgtaag aattttgact actcttgtaa atgctgtgcg gtaaagacta cgggaaaata 13320 
taaattattg catatctaga ggtgaggttc atttctccga tccatttctc caagtaataa 13380 
aaatatatat aaattattaa tatggtatga tatggttatt tgtcctttaa atatgcgatt 13440 
ataggttcaa tctttggtta atctatcaaa tgcatctcaa tattttgaga gaatagtttt 13500 
tagctcttga ctgagagata ctttgagtta aaaaaatgta tttataaact attgtctatt 13560 
gaatttttat tttttttgta acaccacatg tgagtatttt actaaatcca tgttacaatg 13620 
tactgaatgt tttacaatac agtctggttt gataataaaa tatgcacaaa gtccagctta 13680 
tgtaactctt tcaattcatg gaaaatgatg atatgcgaat tgaaagaaag aaaaccataa 1374 0 
tgtaagtgta tttttttttt ctttctcttc tttaatatta ctcaatatca gattgcacat 13 800 
gggaagggta ctggctcatg tgggatatta atttttatcc cttacctaaa aagagagaaa 13 860 
gagaaaataa ctatttaaaa agagtaccac caccaccttt ttcgctttta agttttgttc 13 920 
tgttttcact actctttcat gcgttgtttg ttggcttccg atagtgacga atatgataat 13 980 
gattttacgt tgagaaaaaa aatgtgccat gtaaattcca gagattgaaa ttttagcttc 14040 
aaattaatat ttattttatt atcttttcaa attggttgca tgtgaaagaa tttgaagttc 14100 
ttcaacaagg aaataaacac ttcaatgctc aatccttaga gtcagtcact agatcccaaa 14160 
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agaatgtgat 


gtttacacac 


attagtatat 


taatgagtga 


tgcgcgagga 


cggtggtgac 


aatttttttt 


tctttaagta 


ttactctatc 


cttgcgaacg 


aagcaatctt 


gtcctgatcc 


acctaagcac 


atgatacttt 


gctgccctaa 


gatgcttttg 


ccttcatttt 


caagtttgtt 


ctctttcctt 


tctgggaccc 


cttgtcttta 


taattagtac 


tacttgggtt 


ccattaaatt 


ttggatcaac 


tgcatataca 


tatgttttat 


agttattaaa 


acataatgta 


gagccttgtc 


gctttgattt 


tgattgtacg 


ttttttgttt 


tgtcttggag 


tgctgtgatc 


tttagggtgc 


actaatacta 


gtagattcca 


cgcattcaac 


atggtaagaa 


aaggggagga 


gaaaagtaag 


cttttccatg 


acgtaatgaa 


aacattaatt 


tatgtcaaag 


aaacttagga 


ttttttctct 


aatgattaat 


ttgcgttgtt 


gaattcgttg 


gaaagttata 


tcaaagtatc 


atatactaga 


gtaatgcatg 


caattaatgg 


gcaaataaat 


ttcctccaaa 


acctctcaag 


ttcgcattga 


aaatctaaac 


catgtttgtt 


tcatcttgta 


gaatttagtt 


gttattgacc 


gttaaattta 


attgctttta 


ttcaacgcaa 


attctagtag 


cggtttgatt 


ctctacaaac 


ttgaattgag 


tcttaattaa 


ctctataatc 


aaacctttta 


agacaatcaa 


aatttaaact 


cacacgaatc 


gggagtccct 


taatagcaaa 


agcctcaaaa 


attcaacaaa 


ccacccatta 


ttatgttccc 


cacccataat 


taacccaaaa 


ggaagtttcc 


cctctataaa 


tctattttaa 


aaaaaccccc 



atatatatat aaacacagct tatattatat 14220 
actacatggc caaaacaaga aatgtaattt 14280 
cttgaatatt agtcttaaat cttcaaagac 14340 
tgtgattgtt ctacactagt ccttacgtgt 14400 
cataaattgt accataagga ttcttgggga 14460 
tacactttca agtcgttttt tcaatgcatt 14520 
acacgcgcgc acacaaaaac tagtagacaa 14580 
gtaactacat atacatgtgt tttttttttc 14640 
gagtttcatt gaatttatat ttacacaaac 14700 
attaacctga tgttaggatt ataatttcat 14760 
ttattgtaaa gtgtttattt tattaaaatt 14820 
accgttcaaa agagagacga caaggataat 14880 
cctagttagt ttagctatga caaattaagc 14940 
aatgaaaaca tataacaaat tgtgattgaa 15000 
ttaagataag cttcaactcc taagaagaga 15060 
atttataacg gtaaattact catcctttta 1512 0 
taaagggaaa atcaagaagt ctcttcacgg 15180 
cttcatggct agcttttact ttgattataa 15240 
tatttgaaac caactaaagt acaaataatt 15300 
aattaaataa accaatgcat attaaggcgt 15360 
ttgtatatca tagtattaat aaggtacttt 15420 
ggggtgagtt gttttactat ctcttactca 15480 
gattttcagc ccaaattaaa agaagtttta 15540 
atttctaatt tcagaaatca ataattgcgg 15600 
acccaatctt tttctcaatt ctctgaatta 15660 
tcacacgcct tcccacggta gagaaatata 15720 
ttcagaaaat ctcatccatt gcatttgcac 15780 
ttaataaatc tcacacttaa aaaccctcct 15840 
tccacacaaa aaatatcctt aaacaaagag 15900 
aaatttaagg a 15941 
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<210> 32 

<211> 15172 

<212> DNA 

<213> Glycine max 



<400> 32 
aaaagtctga 


atgaagtgaa 


gttgttgtgt tagggggatg 


ctgaatttga ttttaattta 


60 


atfctaaataa 


attacctgtg 


taaaggattt ttgtaccgcc 


ttttaatcac aaattgtcgt 


120 


Qtatcaaaacr 


tttottaaat 


tttatagtaa ctatcttgga ggttatacct agtaggattt 


180 


ct^attagtt 


aaccatotaa 


atttttttaa tgggtttgtg tttgttttaa ctgatgttgt 


240 


ttttaa.ta.tcr 


t a cert crcrt a a 


gtgtgatgag tttgtgcata tcttgatcaa agactgaagt 


300 


oaacrt totter 


t cr 1 1 1 efcrer a a 

L y u uL *yyyyy 


gttgactctt ttaggaagag 


gatttgaggg aatttcgeca 


360 




a era a ft ccacr 


gacaccctga gaattttgag 


actcegggaa ttgaagttac 


420 


aacaaattta 


at atcracrt t a 
Qua ^-y y i— 1_ o. 


gagtttgtat aatagaatag 


taattctttt tattttttat 


480 


ttttattatt 


tat t tat rra 

wen. i_- a. i_ a. 


ttattgetat ttttgtaggt 


cctcttggtc aggggattgc 


540 


caatactatt 


oat - tt era err* 


ttgcagagaa gcacttggct 


gcaagataca acaagectga 


600 


caatoaoatt 


ottaaccatt 


acaegtaaaa ataactaaat 


catacacaac ttgeaagact 


660 


ttagttgttg 


atottttoct 


aatggcgttt gttttatgea 


ggtatgetat tctgggtgat 


720 


ggttgtcaaa 


t aaaaaaaa t 


tgcaaatgaa gcatgctcac 


ttgctggcca ctggggactg 


780 


ggoaaoctaa 


tagcatttta 


tgatgacaat catatttcta 


ttgatggtaa cacagagatt 


840 


gcgttcaccg 


agagtgttga 


tagtcgtttt gagggacttg ggtggcatgt tatttgggta 


900 


aagaatggaa 


ataatggcta 


tgatgatatt cgtgctgcca 


ttaaggaagc aaaagctgtc 


960 


aaagacaaac 


ccacattaat 


caaggttagt ttactaattg 


tggcaatgat atgeatagat 


1020 


ttctggcttg 


tagttaccat 


aacaatcata atctgtcatg 


atttataggg attcttttaa 


1080 


actaaggtta 


taaacatgat 


atgttactaa agactgeatt 


gcattgtagt acctcatttg 


1140 


agaaactgga 


agttataggt 


gggaaatcaa aatagtaaat 


tatgggtcag tttattaagt 


1200 


gtagctggtt 


tgtcttttgc 


taactctatc atttcacagg tcacaacaac cattggttat 


1260 


ggttctccta 


acaaggctaa 


ctcctacagt gtgcatggaa gtgcactggg tgccaaagaa 


1320 


gttgatgeca 


ccaggcagaa 


ccttggatgg tcacatgagc 


cattccacgt gectgaggat 


1380 


gtcaaaaagt 


atggatgatg 


ggtttagaga tttctttttt 


cctgtgacac tgctgatgtt 


1440 


tataatgata 


ctgaataaga 


attgcttgtc tgtccccagg cattggagtc gccacacccc 


1500 
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tgagggtgct gcacttgaag ctgagtggaa tgctaagttt gctgagtatg aaaagaaata 1560 

caaggaggaa gctgcagaat tgaaatctat tatcaatggt gaattccctg ctggttggga 1620 

gaaagcactt ccggtgagta aattctaaac tcacaggttt ttcttacatg tttgtaactt 1680 

ttcaaggttt ggattacata ctatccaaag ttgatcatga tttcagtttg atttggtagt 1740 

atggggcttc aaaaagttat tccaaatatt aaaacctggc ttccaatttg atttcagaca 1800 

tacactccag agagcccagc ggatgccacc agaaacctgt ctcaaacaaa ccttaatgcc 1860 

cttgcaaagg ttcttcccgg tctgcttggt ggcagtgcag atcttgcttc ttccaacatg 1920 

accttgctca aaatgttcgg ggacttccaa aaggatactc cagcagagcg taatgttaga 1980 

ttcggtgtta gagaacacgg aatgggagct atctgcaacg gcattgctct tcacagccct 2 040 

ggactgattc catattgtgc aaccttcttt gtattcactg actacatgag aggtgccata 2100 

aggctttctg cgctgtctga ggctggggtt atttatgtca tgacccatga ttcaatagga 2160 

cttggagaag atgggccaac ccaccagcct attgagcacc tagcaagctt ccgggcaatg 2220 

ccaaacattt tgatgcttcg tcctgccgac ggtaacsgaa acagccsgag catacaaagt 2280 

ggccgtgctc aacaggaaag agaccctcca ttcttgccct atccaggcaa aaactgcccc 2340 

agcttcccgg aactthcatt gaaggagttg aaaagggtgg ttacaccatt tcgggacaac 2400 

tccactggca acaagcctga tgtcattttg atcggaactg gttcggaatt ggaaatcgct 2460 

gccaaagctg ctgatgacct aaggaaggaa gggaaggctg ttagagttgt ttcccttgtt 2520 

tcttggggaa ctttttgatg agcaatcaga agcstacaag gagagtgttt tccctgctgc 2580 

tgtttcagcc agagttagca ttgaggcagg atcaacattt gggtgggaga aaattgttgg 2640 

agcaaaggga aaagcaatag gcattgatcg ttttggagct agtgctccag ctggaagaat 2700 

atacaaagaa tttggtatca ctaaggaagc tgttgttgct gcagctaaag agcttatcta 2760 

gaacttttga tttttttttg ccttctggtt ttggttgaga gcattccatg tcatgaataa 2 820 

gaaaaaggtt aaatatcctt cgattgggaa tttgggattc taagtgaaga agcaacatag 2880 

atcaaagaat tttgtgaact tgtaataact atttgacaat ttagttttgc actgccacat 2 940 

ggttaaattc aatgtttgat catatcaacc atagatgcta gtttatacta cttgtataat 3 000 

ttgtttcact tgcagattca ctcttagcat ttgagtgttt ggatggccgt gttttgttca 3 060 

ttaagaaatg ttgacatgga tagtttgttg catggtaaat tggtgattaa taccttttta 3120 

agcttctttt ggcaagttgt taaaaagtgt tttgttttaa tctctagtta atctaaaaat 3180 

gcctacttat tttattttga ctttcaaaac ttgtctatat gccgagtata aaaaaaactt 3240 

gtctatataa gttccttatg gaaaaaggac ctacaaaata atttaaaaat gctgaatttg 3300 

ttaggaagtt tgacaacttt agacattaat aattgttttt ttttcatata tataattgac 3360 
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tttcaaaact tgtctatata ttataaattc gttatggaaa gggcgaccac acgtaaaacg 3420 

taacatgcca atctctccct agtgaactgt tattctcatt cagtttttgt aagttcacct 3480 

tcaaattttc aatttttttt ttgacaaaaa tatctttgat ggaacacgtt aggggtgtaa 3540 

attcaccatt catcatttca ttcattatac acaatagtct ttcataatcc aatgaaaact 3600 

tgtttttgtt gttatttttt tctgaaaaat ctttataaaa taaataaaaa catttttttg 3660 

ttataaattt cagttgaaat acataaccat taattctaca gagtgttaat gttgatgtaa 3720 

aataaaaaaa tattggatat caaacttaat gaacagaaca aaaattacaa aattgtacaa 3780 

tttaaatgat aaaaaaccaa attcacaaaa ataaaaatta aggacgaaat ttgagaaatg 3 840 

ataattaaaa acgattaaaa atataattaa actaaaattt attatgtaat tctttaatat 3 900 

atatatatat atatatatat atatatatat atatatatat ataataaaag ttttacatta 3960 

taattttgaa atgaagtatt tgataaacga acaacccacc cttggggttg gtgagagtga 4020 

aacggagaac agtgtctccc tcacttggtc ttctgagtcg agtcctattc atcaccaacc 4080 

cactcgtcaa aattgttgta tgctatgtaa ccattttgtt ttttaataaa taaaagtgtt 4140 

ttgttctgct atgcactccc catcaaatac catttacttt ttttttgtct tctatcaatt 4200 

tctatttagt tcctagttgc tacaacttcc agctggtgaa aaatagtcaa cagagactac 4260 

aatattctaa taaaatagaa taacaaaatc aaatatattc atttaatatg attttatttt 4320 

tatttaattt aaaacatgaa ttatttgttt cttatttaat taccgtttcg ttcacttctc 43 80 

aattaataaa ataaaaaatt tctacaatta tagaaaaagt ttagttttgt agattgttaa 4440 

cataaagatt tttcctatta actaattaat taaaagtcat cttatatata agttttaaag 4500 

taattatctt atatatgtac ccaaaagttt taaagcaatt attataaaaa agtaaaattt 4560 

tatcatatac aataatctat aattacatga aagtataatt tttaaattat taatacgtta 4620 

aaattatttt tttaaaattt gaatgatcat atagtaattt tattttatac caagtcccaa 4680 
catttcctat tttattccac ttttaccaat aaatttcctc ataaacactt ataagaaaaa 4740 
aagtaagtaa agaataaaat aagcttttct ataaactata aacaaatttg tataaacttt 4800 
cttatattag tttttctgaa aactaatttt aaattataca taaattgaat ttactgataa 4860 
ttttttattt tctttttgta taagtgtttt ttaagaaatt tgttcaaaca tatttttatg 4920 
acatgtcaat gcttgagaga tataaggtga gttaaatcat aaatgattaa agaaaaataa 4980 
aagaagaaaa agataattaa tttaattttt tattaacatt ctaacaaatt aacattatat 5040 
aaataaaaat aatatttcaa taattggatc aataatacta ttgttcttaa acaccaaacc 5100 
aaccttccta aagtcctaat ctaatccaaa agccatcatt cattttcctt ggtaggtaaa 5160 
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gttccaaaac ctttaccaac cttttcactc aattattttg gtgtaagcaa ttcgacatgt 5220 

gttagtgtta gttggcaacc aaaaacccct ttatgtgact cactcaatcc aacaacctct 5280 

cacaccacca acccccccat aactcataag gccaaaacca tttctcacaa aacccttcat 5340. 

tacacacatt attatcatta tcacacacac aaaaaacctc tcatttcaaa gagagagaga 5400 

gatttcacag accaaattgc agaggacgac aaagttcaca acttcaagga aaatcgaaat 5460 

ggcccaagtg agcagagtgc acaatcttgc tcaaagcact caaattttyg gycattcttc 5520 

caayyccaac aaacycaaat cggygaattc ggtttcattg aggccacgcc tttggggksc 5580 

ctcaaaatct cgcatcyygr tgmayaaaam tggaagcytt atgggaaatt ttaatgyggg 5640 

gaagggaaat tccggcrtgt ttaaggtttc tgcatcsgtc gccgccgccg cagagaagcc 5700 

gtcaacgtcg ccggagatcg tgttggaacc catcaaagac ttctcgggta ccatcacatt 5760 

gccagggtcy aagtctctgt ccaatcgaat tttgcttctt gctgctctct ctgaggtaaa 5820 

gtttatttat ttatttattt ctgtatccaa aaatgtaaat tgttagtttg ttaatttttg 5.880 

ttagcagtga gagtcgaact acttggcttc tttcttagtc atccaaccaa ccttatatct 5940 

ccaaaattta tttattggtt ttttttgggg ttatgtggtt aggaatagga gtttgatgcg 6000 

tggagtggat tttgaatatt tgattttttt tttttttgta ttattcagtg aaaatgaagc 6060 

atcttgtccc atgaaagaaa tggacacgaa attaagtggc gtatgattta gaatgatgat 6120 

agaaatgtgt ataggtggtt ttaatgtgta gcaataagca tattcaatat ctggattgat 6180 

ttggacgttt ctgtataaag gagtatgcta gcaatgtgtt aataatgtct tgttaaaagt 6240 

atggatggct aaaataatca taaaaatcga gtgggagtag tatacatatc tacaggaaat 6300 

gtattaggtg aggcatttgg cttctctatt gcgagtaagg aacaagtaat ctcagttaat 6360 

gtgaaaatca atggttgata ttccaataca ttcatgatgt gttatttacg ggaacgcaat 6420 

attgactgtt gattttatct gcagctgaga ttatatacta tatccttcca aagaaatctt 6480 

tgactcttga ttatccaagt atggttgtat taccaatttt agctctagaa gataatccct 6540 

cccccaaaac acaaattaga atggtgctgc aagttccgtg ttactttcat tctattattt 6600 

ttataacttt taattattta atagatgtct tgtttggcat aaactataat ttattctgtt 6660 

tttttttatt cttatttagg gaacaactgt tgtagacaac ttgytgtaya gygaggatat 6720 

tcattacatg cttggtgcat taaggaccct tggactgcgt gtggaagayg acmaaacaac 6780 

caaacaagca attgtkgaag gctgtggggg attgtttccc actaktaarg aatctaaaga 6840 

tgaaatcaat ttattccttg gaaatgctgg tactgcratg cgtcctttga cagcagctgt 6900 

rgttgctgca ggtggaaatg caaggtctgt tttttgtttt gtttgttcat catgatctct 6960 

gaattgttcc tcgtataact aatcacatca gactatgtgt tcttcctcca tcctgttata 702 0 
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atctaaaaat ctcatccaga ttagtcatcc ttctttaaat gaacctttaa ttatatctat 7080 
gtatttattt aacatgtaaa ttagcttgtc aagtcaaagt ctagcatata gatatactga 7140 
ttacactctg aggaatgcac ctgagggtct taatcatgat ctatttcaac cttgccactt 7200 
tcttctttta ttattagatc acctatcatg attactggfct tgagtctcta aatagaccat 7260 
cttgatgttc aaaatatttc agctacgtac ttgatggrgt gccccgaatg agagagaggc 7320 
caattgggga tttggttgct ggtcttaagc arctyggtgc agatgttgat tgctttcttg 73 80 
gcacaaactg tccacctgtt cgtgtaaatg ggaagggagg acttcctggc ggaaaggtat 7440 
ggtttggatt tccattagaa taaggtggac taactttcct ggagcaaaat tctaatttat 7500 
accctgtttc cccctcttta gaataagaca ctaagggtat gtttaggagt tgggttttgg 7560 
cgagaaaggg aagggagggc aatttttttt ctaataaata ttctttaatt tgataaattt 7620 
tttaaacgaa ggaatatgaa gatagattag cataacttaa tgttttaatc ttttatttat 7680 
ttttataaat attatatacc tctatttaaa aacaagatat ttttcctcca ttccctttcc 7740 
ctttaaaacc tcagttccaa atataccgta cttgaattat attttggaag gtgtattggt 7800 
tggagacctt tccttttcag aggttatccc tcacctttat tatagccttt ctactctctt 7860 
aatgagttca ttgtgcattg agtcattgaa cttcctgtga aaagaaattg tttacttttc 7920 
tttatttgtt acctacatcc ttattcctgt tttaaaaaat actaagtttt cttttagtta 7980 
tgccttcccc tccttattct ctccaaagaa aatataatag cgaatgttgg tttatccatt 8040 
gtgaatctta ttatcctatt tcacatgcag gtgaaactgt ctggatcagt tagcagtcaa 8100 
tacytractg ctttgcttat ggcagctcct ttagctcttg gygatgtgga aattgagatt 8160 
gttgataaac tgatttctgt tccatatgtt gaaatgactc tgaagttgat ggagcgtttt 8220 
ggagtttctg tggaacacag tggtaattgg gatargttct tggtccatgg aggtcaaaag 8280 
tacaagtagg tttctatgtt ttagcattac atcacttttt agtatccaaa atgcaatgaa 8340 
atcaaaactc atgttttcta tcaggtctcc tggcaatgct tttgttgaag gtgatgcttc 8400 
aagtgccagt tayttmctag ctggtgcagc arttactggt gggactatca ctgttaatgg 8460 
ctgtggcaca agcagtttac aggtattcat gcttgacact attttctatt atcttttatt 8520 
ttatggtcta tactgtatca ttcctaattc aacccccttc ttttctcttt cccctcactt 8580 
ccaaaaaaat agagcttatc attgtattct taatagctct tgtatgtcct ttttcgtctc 8640 
ttgagcagat tctaccgtga ttatcctttt tagtaaactt tttttttgtg tgtcttttaa 8700 
ataattgata tcatcatatg aattcacaca tcatctctat cccttcctca ttttattata 8760 
tttcattaac agtgataaag ataaaaataa attgtcaagt aaagcaagtt agcatgtcag 8820 

24 

BNSOOCID: <WO 022699SA1 J_> 



WO 02/26995 



PCT/GB01/04131 



ataatgatgc 


ttcaattgca aaattggtga aagcttgcac agtacaccag cttxggugac 


8880 


I 

aatxgtgatg 


tagatagcca taacctccca cgcaacttat tatgttguLL gctuaaactt 


8940 


_4~ 4- 4- 4- ^> « _ 4_ 

cttttgcact 


taaacacgga ttagagagag tgaggatgaa caataataac agacacaaag 


9000 


cttagtttta 


taccacccgg cacccagtgg tgaagaccaa gcagtatgag tgagatcaat 


_? VJ o u 


ggcagcatct 


gattgaagct tcttgtggta tttttggaga tgaaatccga ggcgaattag 




ggcatctggc 


tcaacaatag ccattggttc aaacttgctt ttaataaaat. ugattacaga 


9180 


atcaaattct 


tgaggaagcc ttcaagaact acatcatgat gttccctgtg caaaacaggt 




tcaccgacaa 


aagagaacat ctacaagttc ctttacacat gccaaaaatt cctcaactga 


o-inn 


acaattgcca 


agagtcatgt tgcgaacttc aggttggagt tgacgaaatt tggtgcatgt 


yjou 


atgattctgg 


aaatgagcat gaagtttttt cctaagctca taaaagtgaa tgcagccaag 




aaccctagat 


aggattgaat cggagagagg cttgaagcca agataacagc aactgatctt 


OA q n 


gctgttccca 


tgctgcgtag gactcattga ttctaccaag atcacaatct tctgtggtga 


OC /ift 


gaaaatgagg 


gtgaattgtc aaccaataaa gaaaatggtg aaggctgagt cttgatgata 


y b uu 


ggcttgattt 


gcttgcgctg aagaagaacg ttattgttgt agagtttctc agtgattgta 


Q £T C C\ 

y bbU 


tgagagtaag 


ttacaggtgc gattggagat gacggacttg tggtgggagg agggtaagtg 


y / 


ttaagtggtg 


gaatgcgtat cagaggtggt tgcccagcca aagaagctgg ttgcggtgga 


O »7 Q A 

9780 


ggaggtggtg 


gtgcagtgga agccatgggt tttgatgctt ttgagagaaa acatacgaaa 


y o4u 


ctgaactgta 


aacttttgta ttattgatta gctgatttgg tagtacaaga gaggaatata 


y yuu 


tatacagttt 


taatgtaact acctaaccaa acatgaggaa ctctccatac acagaaagac 


y y b u 


gtatacaaag 


gataacagac aaacagaaac ttagatatac tcatacagag aagaattaca 


1UUZU 


ggatgaatct 


gagaatttta attgggaaac tgaggaaaat agcaagggtt tcagaattca 


i n n q n 


ttttgacgat 


aggcaaatca tggacttaac ttagacctca agggtacgct tgggagtggg 


1U J-rt U 


ggtttttaga 


gggaaaaggg aaggagggca aacttttaat tcctaataaa tattctcttt 


lUzUU 


tttgattttt 


ttttaaataa aacaaataaa cttatagatt agcatacact taatgtccta 


iuzdu 


atcttttatt 


j iii^ » illilti i_i_i_X* _ 4— 4_ 4_ _ 4- 4- 4- 4- 4— 4— 4— — 4— *T+» +- ^ 4" ♦"•+"+" 

tatttgcatg agtttttttc ttttatattc tttctgcttc z gut. txc cut: 


1 U J z u 


cttcagtgtc 


tcttgtcttt ccttaattcc ttatcaatac cttgtttatx ttcgaaaact 


1UJOU 


4- — . 4_ 4- 4- 4- 

taattttcct 


cattttttta acttataggc tatactatag tatgeauaac cattcaaaca 


1 0440 


aatagtagaa 


aatgtttcaa atattcccca gaactgtttc agtgaacaaa gttcaageca 


10500 


gaaaaaagaa 


gtatatcttc ttgtgaagag tagtctcaag tggttaattg tcaaattgta 


10560 


tcattttaga 


atacaaaatt agaagtggtt tgattagttt atacgaacga ttatttaaat 


10620 


agggttgatg 


atgtggttaa ttactgatca tatgtgaaga ctgtgagtcg atgaccagcc 


10680 
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cagtgcagtc attttaaatt ccattatttt ttaatttgtg cttatgttaa actttgtcct 10740 
tctttcaaag ggagatgtaa aatttgctga agttcttgaa aagatgggag ctaaggttac 10800 
atggtcagag aacagtgtca cygttwctgg accrccacra gattyttctg gtcraaaagt 10860 
cttgcraggc attgatgtca atatgaacaa gatgccagat gttgccatga cwcttgcygt 10920 
tgtygcacta tttgctaatg gtciranactrc yatmagagat ggtatggttt agctgtttgt 10980 
ttgtgttcaa gagcttgttt taacaaattg ttttttttaa ttatggttaa tgaaattatt 11040 
ataattacac taatattgaa attggatcac taaccatgtg aaatgcttga atttttagct 11100 
tgaaaacttc ttttgtggac aatttttctt ttatctatgg attgtggggc atctatgttg 11160 
ttcctttaag cttcgtacaa cctggtctgt gggctggtcc agtccttctt actcatgaca 11220 
gtcttttgca gtaaactggt taaacttggg tgacctgttg acccacccag attggcctgt 11280 
tctttttaaa taaaacttgg ctacttataa gatttcaaca tgatacttga atgatcaccc 11340 
ttactctaaa ggccattatc tgttcagtta tgttcttctt cttcttatta tttttaattt 11400 
atattattat ccaacgatct gatctttgaa catgatgctg gacaatcatc aaagctagtt 11460 
tggttacatc atataatacc ttttttttca tatattattt ttcccatagt atatttttgt 1152 0 
gttttttttt catacaatat tattctgaaa aaaacttgtt tcgttttttt cctggcttga 11580 
atgctatttc aatgactggg ccttattatt ttctggttta ggttatacaa ctgtcctcca 11640 
attgactttt gaatgcttca atagttttct tcttaatcat ggaaggcact agagtgcata 11700 
ttgaatatgt tagaacatgg aaggcttaaa tatcttttta gtctttgcaa tttaacgttt 11760 
ttttgctttt agtccttgca aaattataat ttttttttag tcctgactaa ttatgtttgt 11820 
tttgtttttg agggcgagcc ctggtgcagc ggtaaagttg tgccttggtg acttgttggt 11880 
catgggttcg aatccggaaa cagcctcttt gcatatgcaa gggtaaggct gcgtacaata 11940 
tccctccccc ataccttcgc atagcgaaga gcctctgggc aatggggtag aaaaagaaga 12000 
atgctttaga taacactttt tttgttactg ttcaaagcac tatctaaagt gctttgagga 12060 
ctaaacacaa aacaaacata atttgcaagg actgaaaaac aaaaaaataa ttttgcaagg 12120 
acaaaaaaac aaaaaaaaaa aacaaaattg caaggaccaa aaatatattt aaaccaacat 12180 
gaaattatgg tataaaacca ttcatatgtc ttcatctaat agcttaagct tttgggatag 12240 
ttggttaaca atatggtatt agagtcttat gatttttagg tggtctttga gttcaatcct 12300 
tgctgctccc agttctagtt aatttctttt tggtccactt caaataaaaa gttgaattta 12360 
agggtaaggc agctggacct gcgcattaat catgtgtcag gctcaaaggg cttttgtatg 12420 
gg9gg cat 9 t ttggaatata atatagaatc attcatgtgg tttcacctat caaggacaac 12480 
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tctagttttt gtcatacttg tattattaca gtgtctaaac tagacatgat ctttttccgg 12540 
aaggaaattt aacttaaaag gaacttagat gcttcttgtt ctgtgatctt tatttgttta 12600 
ttttttccaa tctttttaac aattgaaagg ttttagttgt atccaatctt tttatacatt 12660 
tggactatct accatatcat ttagcaaact tgtttgctgc tcagattctc ctttggacta 12720 
tctaccatat catttagcaa acctgttcac tgctcagatt ctcatttatt ctttgtttcc 12780 
tctatttttc agtggcaagt tggagagtta aagagactga gaggatgata gcaatctgca 12840 
cagaactcag aaaggtgagt gtcttgctaa ttcctttatg gtttcgatat tactggactt 12900 
tttacatctc actgacctac tctgtcttgg tatttcagct aggagcaaca gttgaagaag 12960 
gtcctgatta ctgtgtgatt actccacctg agaaattgaa tgtcacagct atagacacat 1302 0 
atgatgacca cagaatggcc atggcattct ctcttgctgc ttgtggggat gttccagtaa 13080 
ccatcaagga tcctggttgc accaggaaga catttccyga ctactttgaa gtccttgaga 13140 
ggttmacaaa gcactaagca ttttgtacat tgagtaggaa gagagagaga gataaatact 13200 
cacaaaacag tgagtatatg catggcttgc ttttgtttct taacattccc ttgtaagaac 13260 
ttgagataat ttgtattgtt ggaaagagct attgattctg ttgtatcaaa gtaatttaat 13320 
ttccagtaca gacttggctc tattcctcat atgtggtggt gtacattgca tctttgcatt 13380 
tgatcccatt tctattggga taggaccata cctttttttg tttattattc aagtcaaact 13440 
tagcagataa tatgtggtga ccagaaaata gtctagctag caaatagacc tgtgattgca 13500 
tgtttgaatt caggacttca ggacatttaa ttaatgagtg acaatgttaa aaagctcgcc 13560 
aacaggagtt gacacatctg attgagtctt ctattaacac taactgtaac ttgagtttga 13620 
ttatcagatt cctcctgttg gtaggtaata tgtaaatttc actatatatt taccgaggtt 13680 
ggtggtctta cttgactttg aagaggtgag tgtaatttaa acttttatat caaaaaaaga 13740 
ttgctaaaag cttgagtggc atgcaacaaa tagcaatgaa agaatgacta tggccttggg 138 00 
aaccaaatgg ccacgtacca cttttccacc aacttgagag ccctggtcct ggatgatgcc 13 860 
cctttttttt tttcatttct ttgacaataa agaaagttct tattcttggc aattgcaagt 13920 
cgtttgtgga agtcacatga aattcaccat aggcttcatt gttcaaatcc tttagtcact 13 980 
aactaaaggc tggacctatg caagaatttt ggctactctt ataaatgctg tgcggcaaag 14040 
actacggaaa aatataaatt attgcatatc tagaggtgag gttcattgct cccatccatt 14100 
tttccaagaa ctaaaatata tataaattat taatatgata ttacattgaa aaataatatt 14160 
tttcttttaa atatgtgatc ataagtttaa ttttaagtct attcatacga aaaaaatatt 14220 
tattagaaaa gattaattta ttaaatggat ctcaatattt taagaaatta acttttagct 14280 
ctcaaccaac agatgatttg agttaaaaaa atgtatatat aaactattgc atatctgctg 14340 
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atttttgttt 


gtaacaccaa 


taagtgggca ttttactaaa 


tccatgtgac 


aatgtactga 


14400 


atgttttaga 


atacagtctg 


gtttgataat aaaacgtgca 


taaagtccag 


cttatgttaa 


14460 


ctcttaattc 


atggaaaatg 


atgatatgcg aattgaaaga 


aagaaaacca 


taatgtaagt 


14520 


gcagttttct 


ctctcttttt 


taatattatt caatatcaga 


ttgcacatgg 


gaaggggtat 


14580 


tggctcatgt 


gggatattag 


tttttgtccc ttacctaaaa 


agagagaaag 


agaaaaataa 


14640 


caatttaaaa 


agagtaccac 


cgctaccacc tttttcgctt 


ttaaattttt 


gttctgtttt 


14700 


cactactctt 


tcatgcgttg 


tttgttggct tctgatagtg 


acgaatatga 


tataatgatt 


14760 


ttatgttggg 


aaaaaagtgc 


catgtaaatt ctagagattg 


aaattttagc 


atcaaattat 


14820 


ttattttatc 


accttttaaa 


atttgttgca tgtgaaagaa 


tttgaagctc 


tccaacaagg 


14880 


aaataacact 


tcaatcctta 


gagtcatcct ctagatccca 


aaagaatgtg 


atgtttatac 


14940 


acattagtat 


atatatatat 


atatatatat atatatatat 


atatatatat 


atatatatat 


15000 


atatatatat 


agggtttata 


ttatattaat gagtgatgta 


cgaggacggt 


ggtgacacta 


15060 


tatggccaaa 


acaagtgtaa 


tataattatt tttttttctc 


aagtattact 


ctatccttga 


15120 


atattattag 


tcttaaatct 


tcaagcacct tgtgaacgaa 


gcaatcttgt 


cc 


15172 



<210> 33 

<211> 32 

<212> DNA 

<213> Primer 

<220> 

< 2 2 1 > mi s c_f eature 

<222> (9).. (9) 

<223> n is inosine 

<220> 

<221> misc_f eature 

<222> (15).. (15) 

<223> n is inosine 
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<220> 

<221> misc_feature 

<222> (21).. (21) 

<223> n is inosine 

<220> 

<221> misc_feature 

<222> (27).. (27) 

<223> n is inosine 

<400> 33 

gcacarcgng caagngaraa ngccatngcc at 

<210> 34 

<211> 29 

<212> DNA 

<213> Primer 

<220> 

<221> misc_feature 

<222> (18).. (18) 

<223> n is inosine 

<220> 

<221> misc_feature 

<222> (24).. (24) 

<223> n is inosine 

<220> 

<221> mi sc__f eature 

<222> (27).. (27) 

<223> n is inosine 
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<400> 34 

gcwggaacwg cmatgcgncc rytnacngc 29 

<210> 35 

<211> 15941 

<212> DNA 

<213> Glycine max EPSPS with double mutation 



<400> 35 



aaaaaatacc 


ttttagtaaa 


ataagagcgt 


r agat caat c 


agacgttttc 


tctttgggac 


60 


ttctcattct 


taattgaatt 


gacaataact 


aaagtgaaac 


taaggctaaa 


atcaactcac 


120 


ctagtcaagc 


tcgtccacaa 


aaataggttt 


ttgaaagttt 


accatatcaa 


tttcttacta 


180 


agtaaaaaag 


gatcatttgt 


aaggtccaac 


gccttaaaat 


gatcaccctt 


caaagtaaaa 


240 


gaatcacttg 


tttcacgcat 


aagtaagaac 


tacgtaggtc 


tgatttcctc 


tccaaaaaaa 


300 


ggtacgtagg 


agcaaaagcc 


ccgcttttgt 


cgaccttaaa 


aaataaaaag 


aaacaaaagt 


360 


taaggtaaca 


caatttccac 


aattctagaa 


ataaggttgt 


tgtctttcga 


gacaaacgta 


420 


agaggtgcta 


ataccttcct 


caaacgtaaa 


tacaactccc 


gaacttagaa 


ttttcatttt 


480 


gaccggtttc 


cttcggtttt 


cccgacgttt 


tccacaaata 


aacgttggtg 


gcgactccgc 


540 


crcafc ctfct rr 




sgcgcaccca 


tgagcctcgc 


cctcgctcgc 


ccgcgaaggg 


600 


cacgttgcga 


cactaggtta 


ataagaagtt 


tagtccacat 


atctatggac 


ctcttaaggt 


660 


gattatcaaa 


gtggaagtgg 


tgacctacat 


gttggcccta 


ccagcttatt 


cgtatattta 


720 


cccccatcaa 


ccatgtatct 


cagctgaaac 


atgctatcaa 


tccaaatgtt 


atcagaggaa 


780 


catgagctgc 


atgttcaacc 


agaagtagtt 


ctagtcttaa 


tacaaatgtc 


aaatggcaca 


840 


agtgagattt 


tgaatttctg 


atgttgtaaa 


aatctcagga 


catgaatact 


attgggaagc 


900 


aattattcat 


acttcaccaa 


tccaaactga 


cccaaaattc 


tcaaatcaca 


tgaaagcaaa 


960 


aatgcatata 


acacgaagaa 


taagaagaag 


aggaactaac 


ctggggtttc 


gatgattaaa 


1020 


gcgttgttgt 


tgatgatgaa 


aacgatgatt 


atggagagaa 


attgttgttg 


aatggtgaaa 


1080 


ttgttataga 


aagagaacga 


agatagagaa 


aaagatatat 


agatttttca 


aggctcaaac 


1140 


cctaaaatca 


ccatgagaga 


gaacaaagat 


tgagaaacct 


acaaccacta 


tgagagagaa 


1200 


tgagcagaac 


agaagcgtga 


gatagagaac 


gagagttaag 


gtgcgagagg 


acacgaagaa 


1260 


caa a aggtgt 


gagagaaaga 


acaaaggagc 


ctacggtgtg 


agatgagaga 


atttgaaatt 


1320 



30 



BNSDOCID: <WO 0226995A1_I_> 



WO 02/26995 



PCT/GB01/04131 



cttaccattt aggtggaatt tcaattctac aattttattc tattaaaatt attttaaaaa 1380 

atgatgtcat tttaaattct ttaaaatctc atatccaaca actgaattat gatagaggta 1440 

tttcaaattc acttaaaaaa attatcttat ttaaataccc catccaaaca tagcgaaatg 1500 

ttcatgagaa ggatcaagtg gtttggaaac atagtactaa tggtgtttat acagttcatg 1560 

gaatccttga tagataattt aaaggttgct ggaaattgga tgaaggtgtg gagattaaat 1620 

attcttccaa aaataaagcg ttttatttgg agagtgttgt gtggttgtct cccctgtagg 1680 

caaaagcttc gatgtaaagg agttcaatgt ccaataacct atgctttcta tccctcgatt 1740 

attgaaaatg aatgacacat tttatttggt tgaaatcaag aaataagcat gtggcaagca 1800 

acgggtattt gacaattcat agaacaaaag gtgaatgcag caaaagaatt aatgaactcc 1860 

ttttcgatct acttggatca ctacatggag atattatcaa caaatttgat gttactttat 1920 

ggagcagttg gaattcttgg aatgacaaga tatgaaatga acataccaac cctcctcttg 1980 

tttctgtttc ggtttctatg cagtattttg ttgaatggca aagtgcaagg taatatgctc 2040 

ctcaacatca attaacaaat gttcatgaca tctcttacca gctccaactt ggggacgttt 2100 

gacaaacacc accgtcaagt ttccttaaat gcaacattaa tgttgctcat ttcaaggagg 2160 

agaatagttt tggtgtcggc atgatactcc atcaaggaag attcgtcaaa gctcactcac 2220 

gttttcgaca tgggtcgaca tgggttacct gacccaaagg ctgaggctta ggcttgggtt 2280 

tgcttcaagt attgatctgg gcccagacta ttggtttaca taatatcatt tttgaaaacc 2340 

taacatctaa aactcaaggt tgtttagagg tgcgccattc caaaataaga ttatcctatt 2400 

tgtgcatgaa tgcgaccaac tatctcctgt ttcagcatta taaagtataa acaacaaact 2460 

tctttaatca agggactaaa agatattgga catacaagct aaaagtgata gaatttgaga 2520 

aaacaaatat tgacaacaat attcaagagg acactaaaac ataattctca aatttttttt 2580 

gtttatttaa aataaagtgg ttcattaggt agctcttgaa aaaaaaaagt gattaacatg 2640 

ctagtacatg ttaaaaaaag tgaataatat tctattataa gtaaagagga ataatttatc 2700 

gatgtctcat ttttgagata gttcaccgaa gccaattttt ttactttttt ctcttcaatc 2760 

aattttattt tgttaatagt tgtcattgtg gatggattat aataattttg aaatgattta 2 820 

aaatgagata tatattaata aatttaatta attcataaat atgttactaa atataaattt 2 8 80 

catggtatac tcaggacggt tataacggaa atcaaataaa taataaaata gaataatatt 2940 

aatttgcagt aaaaagaata atattaattt attctgttta cactatgtat aacgaaatta 3 000 

aatatattca tttaatatga ttttattttt ctttaattta aaaacatgaa ttatttgttt 3 060 

cttattttat taccattttg ttcacttctc aattaataaa ataaaaatat attctacaat 3120 
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tatagtaaaa aggtttaatt ttatataatg 
ttaattaaaa attatcctta ttttaaggaa 
aaattaatta tcttatatat gtaccaaaaa 
aatttatcat ataaaataat ttataattaa 
tatttaaaat ttttgaatga taatataata 
ctattttatt ccacttttag caataaattt 
gtaaaaaata aaaaaaagtt tttcaataaa 
aaaataaagt tatacattga taatataaat 
tatatgataa atttattgat tttttaaaat 
tgcaattaga tgataatata aaattatttt 
atatattagt ttttctaaaa actaattttt 
ttatcttctt tttttataag tattttttaa 
tcaataatac tactctctta gacaccaaac 
agccatcatt cattttcctt ggtaggtaaa 
aattgttttg gtgtaagcaa ttcgacatgt 
ttatgtgact caatccaaca accactcaca 
acccttcatt tacacattat catcaccaaa 
agagagagag agacttcaca gaccaaagtg 
aaaattgaaa tggcccaagt gagcagagtg 
ggccattctt ccaactccaa caaactcaaa 
ctttgggggg cctcaaaatc tcgcatcccg 
tttaatgtgg ggaagggaaa ttccggcgtg 
gagaagccgt caacgtcgcc ggagatcgtg 
atcacattgc cagggtccaa gtctctgtcc 
gaggtgaagt ttatttattt atttatttgt 
ttgatgtgta gagtggattt tgaatatttg 
agcatcatgt cccatgaaag aaatggacac 
gatagaaatg tgtgtagggt tttttaatgg 
gatttggacg tttctgtata aaggagtatg 
aatactccta aaaatcaagt gggagtagta 
gcatttggct tctctattgt aaggaacaaa 



ttagcataca 


gatttttctc 


tattaactaa 


3180 


attaaaaatt 


atcatatata 


tataagtttt 


3240 


gttttaaagc 


aattattata 


aaaattaata 


3300 


attttaaatt 


atcaattcat 


taaattaaat 


3360 


attttatcct 


ctactaagtc 


ccaacgtttc 


3420 


tgtcataaac 


acttataaca 


aaaaaagtaa 


3480 


gtataaacta 


atttgtataa 


acttttagaa 


3540 


tttttacata 


attatccgat 


caactcatta 


3600 


aattatctta 


aaataattta 


aacaatgatt 


3660 


acacactaca 


fcgtattaaac 


tcaaactttt 


3720 


aactcaaaaa 


aaatgttact 


tataattttc 


3780 


gaaatttatt 


gaaacatgac 


catcrcttcrcrq 


3840 


aacccttccc 


aa.acta.taat 


ctaatccaaa 


3900 


gttccaagac 


cttcaccaac 


tttttcactc 


3960 


gttagtgtta 


gttggcaacc 


aaaaatccct 


4020 


ccaccaaccc 


ccataaccat 


ttctcacaat 


4080 


aataaataaa 


aaaaacctct 


catttcagag 


4140 


cagagaacaa 


caaagttcac 


aactttaagg 


4200 


cacaatcttg 


ctcaaagcac 


tcaaattttt 


4260 


tcggtgaatt 


cggtttcatt 


qaqqccaccrc 


4320 


atgcataaaa 


atggaagctt 


tatgggaaat 


4380 


tttaaggttt 


ctgcatcggt 


cgccgccgca 


4440 


ttggaaccca 


tcaaagactt 


ctcgggtacc 


4500 


aatcgaattt 


tgcttcttgc 


tgctctctct 


4560 


ttgtttgttg 


ttgggtgtgg 


gaataggagt 


4620 


attttttttt 


gtattattct 


gtgaaaatga 


4680 


gaaattaagt 


ggcttatgat 


gtgaaatgaq 


4740 


gtagcaataa 


gcatattcaa 


tatctggatt 


4800 


ctagcaatgt 


gttaatgtat 


ggcttgctaa 


4860 


tacatatcta 


cagcaaatgt 


attaggtgag 


4920 


taatatcagt 


taatgtgaaa 


atcaatggtt 


4980 
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gatattccaa tacattcatg atgtgttatt tatatgtacc taatattgac tgttgttttt 5040 

ctccgcaatg accaagatta tttattttat cctctaaagt gactaattga gttgcttact 5100 

ttagagaagt tggacccatt aggtgagagc gtggggggaa ctaatcttga atatacaatc 5160 

tgagtcttga ttatccaagt atggttgtat gaacaatgtt agctctagaa gataaaccct 5220 

cccccaaaac acaaattaga atgacatttc aagttccatg tatgtcactt tcattctatt 52 80 

atttttacaa cttttagtta cttaacagat gtcttgttca gcataaatta taatttattc 5340 

tgtttttttt tagggaacaa ctgttgtaga caacttgttg tatagtgagg atattcatta 5400 

catgcttggt gcattaagga cccttggact gcgtgtggaa gatgacaaaa caaccaaaca 5460 

agcaattgtt gaaggctgtg ggggattgtt tcccactagt aaggaatcta aagatgaaat 5520 

caatttattc cttggaaatg ctggtactgc aatgcgtcct . ttgacagcag ctgtggttgc 5580 

tgcaggtgga aatgcaaggt ctgttttttt ttttttgttc agcataatct ttgaattgtt 5640 

cctcgtataa ctaatcacaa cagagtacgt gttcttcttc ctgttataat ctaaaaatct 5700 

catccagatt agtcatcctt tcttcttaaa aggaaccttt aattatcaat gtatttattt 5760 

aatatttaaa ttagcttgtc aaagtctagc atatacatat tttgattata ttctgagaaa 5 820 

tgcacctgag ggtgttcctc atgatctact tcaacctctg ttattattag attttctatc 5 880 

atgattactg gtttgagtct ctaagtagac catcttgatg ttcaaaatat ttcagctacg 5940 

tacttgatgg ggtgccccga atgagagaga ggccaattgg ggatttggtt gctggtctta 6000 

agcaacttgg tgcagatgtt gattgctttc ttggcacaaa ctgtccacct gttcgtgtaa 6060 

atgggaaggg aggacttcct ggcggaaagg tatggtttgg atttcattta gaataaggtg 6120 

gagtaacttt cctggatcaa aattctaatt taagaagcct ccctgttttc ctctctttag 6180 

aataagacta agggtaggtt taggagttgg gttttggaga gaaatggaag ggagagcaat 6240 

ttttttcttc ttctaataaa tattctttaa tttgatacat tttttaagta aaagaatata 6300 

aagatagatt agcataactt aatgttttaa tcttttattt atttttataa atattatata 6360 

cctgtctatt taaaaatcaa atatttgtcc tccattccct ttcccttcaa aacctcagtt 6420 

ccaaatatac cgtagttgaa ttatattttg gaaggcctat tggttggaga cttttccttt 6480 

tcagagatta tccctcacct ttattatagc ctttctattt ttaaacttca tatagacgcc 6540 

attcttgttt taaaaaacac taagttttct tttagttata ccttcccctc cttattctct 6600 

ccaaagtaaa tattgtagca agtgttggtt gatccattgt gaatcttatt atcctattta 6660 

acatgcaggt gaaactgtct ggatcagtta gcagtcaata cttgactgct ttgcttatgg 6720 

cagctccttt agctcttggt gatgtggaaa ttgagattgt tgataaactg atttctgttc 67 80 
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catatgttga aatgactctg aagttgatgg agcgttttgg agtttctgtg gaacacagtg 6840 

gtaattggga taggttcttg gtccatggag gtcaaaagta caagtaggtt tctatgtttt 6900 

agtgttacat cacttttagt atccaaaatg caatgaaatt aaaactcatg tttcctatta 6960 

ggtctcctgg caatgctttt gttgaaggtg atgcttcaag tgccagttat ttactagctg 7020 

gtgcagcaat tactggtggg actatcactg ttaatggctg tggcacaagc agtttacagg 7080 

tattcatgaa tgacactatc ttctattatc ttttatttta tagcttacac tatcattcct 7140 

aattcaaccc ccttcttttc tctttcccct cactcccaaa atagagctta tcattgtatc 7200 

cttaatagct cttttatgtg ctgtttcatc tcttgagcag attctacatg attatccttt 7260 

ttagtgaact tttttgtctt ttaaataatt gatatcatca tttgaacttg cacatcatct 7320 

ctatccttcc tcattttatt atatttcatt aagagtgata aaattacatt gtcaagtaaa 73 80 

gcaagttagc gtatcagata atgatgcttc gattgcagaa ttggtgaaag cttgcacaat 7440 

tcattagctt agtgacaatt gtgatgtaga tagccataat cttccatgca acttattatg 7500 
ctttttgtct aaaacttctt ttgcacttaa acatgaatta gagagttagg atgaacaata 7560 
gtaacagaca caaagcttat ttctagacca cccagtggtg aagacgaagc agcatgagtg 7620 
agattaatgg cagcatctga ttgaagcttc ttttggtatt attggagatg aaatcttgag 7680 
ccaaattagg gcttttgcct cctcaatagc cattggttca aaattgcttt caataaaatg 7740 
attacagaat caaattctta agaaagacct tcaagaacaa catcaagatg tttcctgtgc 7800 
aaaacaggct cacctacaaa agagtgtcta caagcacctt tacacttgca aaaaattcct 7860 
taactgaaca attgccaaga gtcatgttgc aaacttgaga ttggagttga ttagatttgg 7920 
tgtgtgtgag attctggaaa tgagcatgaa gttttttgca aagctcataa aatgaatgca 7980 
gccaagaatc ctagatagga ttgaatttga gagagaggct tgaagtcaag ataaccgcaa 8040 
ctgatcttgc tgttcccatg ctgcatagga ctcgttgatt ctaccaagat aacaatcttc 8100 
tgtggtgaga atatgagtgg gaattgtcag gcaacaaaga aaatgaaggc tatgagcctt 8160 
tgttaaggtg cagagagaaa acagaaaatg aagaacttgt gtattgaagg aattgaaaag 8220 
ccaattacaa ttgtgttgca gagagatatt tatagttctg aagtgtataa ctaactaaac 8280 
agaatgctac caacctagag gcagttgaga attgctccct tattacaata ctctcacagc 8340 
cttgatgata ggcttgattt gctgccaccg aagaagaaca ttattgttgg agagtttttc 8400 
agtgattgca tcagagaaag ttacaagtgt gattggagat gaaggacttg cggtcgtcga 8460 
aggttaagtg ttaagtggtg gaatgtgtat cggaggtggt tgcctagccc aagaagctgg 8520 
ttgcagtgga agcggatgcg gcagaagcca tgggttcaga tcagaactat gttctagata 8580 
ccatattagg gttagctttc aagagaaaac ttatgaaact gaactgtaaa ctgttgtatt 8640 
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attgattagc tgatctagta gtgatggaag cttgcttgcg aagcttctat ggaggctgga 8700 

tcattgagct tcaatgaggt ccttcaatgg tgatttttca ccatggagat gcagcggaag 8760 

ataaatgaag aggtgagagg aggcgtcatc cactaggaaa taaactatgg aagaaggagc 8820 

ttcaccacca agagagtgtc ttggataaga agcttagaga ggaagcttca atggaggaat 8880 

agaaagagag agggggaaca aaaaattgaa ggaggaaaag agggagagaa gttgaacttt 8940 

gaagtgtgtc tcacaagact cattcattaa agttacaaca agtgttacac atgcttctat 9000 

ttatagtcta ggtaacttcc ttgagaagct agaacttaac tacacacacc tctctaataa 9060 

ctaagataac ctccttgaga aatttccttg agaagcttcc ttaagaagat tcctagagaa 9120 

gctaaagctt agttacacac acctctctaa taactaagtt cacctccttg agatgagaag 9180 

ctagagcctt agctacacac acccttataa tagctaaact cactctattc caaaatacat 9240 

gaaaatacaa aaaagttcct actacaaaga ttactcaaaa tgtcctgaaa tacaagacta 93 00 

aaactctata ctactagaat ggtcaaaata caaggtccaa aagaaggaaa aacctattct 9360 

aatatttaca aagagagtgg acccaacctt agtccatggg ctcagaaatc taccctgagg 942 0 

ttcatgggaa tcctagggtc ttctttagta gctctagtcc aattttcttg gagtcttcta 9480 

tccaatatcc ttgggggtag gattgcatca tgtagtacaa gagaggggaa tatatataca 9540 

gttgtaagtt gtaactacct aatcaaaccc aactgactca ccatacacag aaaaacaatg 9600 

tacaaaggat aatagaaaaa ctgaaactga gatatactca ttacatagaa gaattacatg 9660 

atgaatctga aaattttaat tgggaaactg aggacaataa caaggctttc agagttcatt 972 0 

ttgacaatag gtaaattatg gacttcaact tagacctcaa gggtatgctt gggagtgggg 9780 

ttttagagga aaaggggaag gagggaaatt ttttaattct agtaaatatt ctctattttg 9840 

attaattttt aaaataaaac aaataaactt atagattagc attcacttaa tgtcctgatc 9900 

ttttatttgt ttcatgaagt tttttatagc ctttctgttt ctgttaaaag aaattggagg 9960 

gtgttcataa tctatatgcc catatatatg tattgtcctg ttgtctgtgt ttctgttttc 10020 

tttctttagt gtctcgcctt tccttaattc cttatcaaca ccttgtttat ttttgaaaac 10080 

ttaaatgtcc tcattttttt tacctgtact ataactttcc aaacaaatag tagaaaatgt 10140 

ttcaaatatt ccccagaact gtttcagtga acaaagttca agccagaaaa aaagcaagca 10200 

gaggctaaag aagtatatct tattgtgaag agtaatctca attggttaat tatcgaattg 10260 

tatcatttta caatagaaaa ttagaagtgg gttgattagt ttatacgaac aattatttaa 10320 

atagggttga tggtgtggtt aattactgat catatgttaa gactatgagt cgatgaccag 103 80 

cctcagtgca gtcattttaa gttccattat tttttaattg tgcttatgtt taactttgtc 10440 
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cttcttttaa 


agggagatgt 


aaaatttgct 


gaagttcttg aaaagatggg agctaaggtt 


10500 


acatggtcag 


agaacagtgt 


cactgtttct 


ggaccaccac gagatttttc tggtcgaaaa 


10560 


gtcttgcgag 


gcattgatgt 


caatatgaac 


aagatgccag atgttgccat gacacttgct 


10620 


gttgttgcac 


tatttgctaa 


tggtcccact 


gctataagag 


atggtatggt ttagctgttt 


10680 


gtttttgctc 


aagagcttgt 


tttcacatat 


ttgtggtcaa gagcggttta gctgtttttt 


10740 


tttttaaatt 


gtggttaatg 


aaattattat 


aattatacta 


atattgaaat tggatcacta 


10800 


accatgtgaa 


atgcttgaat 


ttttagcttg 


aaaacttctt 


ttgtggatga tttttctttt 


10860 


atctatgtag 


tgtggggcat 


ctatgttgtt 


cctttaagct 


ttgtgtcata gttgtacaac 


10920 


ctggtctgtg 


ggctggtcca 


gttcttactc 


atgactgtct 


tttgcagcaa actggttaaa 


10980 


cttgggtgac 


ctgttgaccc 


acccagattg 


gcctgttctt 


ttttaaataa aactccaaaa 


11040 


ggctacttat 


aagatttcaa 


catgatactt 


gaatgaccac 


ccttacttta aaggtcatta 


11100 


tgccatcaga 


ccccttgatt 


ctctgttcag 


ttatgttctt 


attcttatta ttttttaatt 


11160 


tatattatta 


tccaacgatc 


tgatctttgt 


acatgatgct 


ggacaatcat caaagctagt 


11220 


ttggttacat 


catataatac 


cttttttttc 


ataaattatt 


ttttcccata atatatttat 


11280 


atgtattttt 


ttaatacaat 


attctgggga 


aatttttttt 


ttctgttttc ttttttctgg 


11340 


tttgaatact 


atttcaatga 


ctgggcctta 


ttattttctg gtttgggtta tacaactgtc 


11400 


ctttgatgta 


cttttgaatg 


cttcaatagt 


atttttcata 


atcatggaag gcactagagt 


11460 


gcagtgcata 


ttgcacaagg 


ttctttaatg 


gcatcttgaa 


tttgttagaa catggagtta 


11520 


tgatatacaa 


ccattcctat 


gtcttcatct 


aatagctaaa gcttttggcg tagttggtta 


11580 


acatggtatt 


ggaatcttat 


tttaaaggtc 


catcttattc 


gcaagtacaa ggcaggtgga 


11640 


cctgcgcatt 


attcacgtgc 


caggctcaaa 


gggattttgt 


at 9 a 99ggg a atttttggaa 


11700 


tataatatat 


agaatcattc 


atgtggttca 


cctatcatgt 


gtcaactcaa gtttttgtcc 


11760 


tagttgtatt 


attgcagtgt 


ctaaactagg 


catgatcttc 


tgtggaagga aatttaacct 


11820 


aaggaacttg 


aatgcttctt 


gttctgagat 


ctttattttt 


tttctccaat ctttttaata 


11880 


cttgaaaggt 


tttagttgta 


tctataaagg 


gtgcagaatt 


agggggtcag cttttctgtt 


11940 


gtaagcgtga 


attaagtgca 


ttttatgttg 


ttccctattt 


tatacatttg aactatcaac 


12000 


catatcattt 


agcaaacttc 


tttgttgctc 


agattcccat 


ttattcttgt ttcctctatt 


12060 


ttttagtggc 


aacrttaqaqa 


crfc taaaaacra 


ctgagaggat 


gatagcaatc tgcacagaac 




tcagaaaggt 


cttgctaatt 


cctttatggt 


ttctatatta 


ctggactttt tacatctcac 


12180 


tgacctacta 


ctgtcttggt 


atttcagcta 


ggagcaacag ttgaagaagg tcctgattac 


12240 


tgtgtgatta 


ctccacctga 


gaaattgaat 


gtcacagcta tagacacata tgatgaccac 


12300 
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agaatggcca 


tggcattctc tcttgctgct tgtggggatg ttccagtaac catcaaggat 


12360 


cctggttgca 


ccaggaagac atttcctgac tactttgaag tccttgagag gttaacaaag 


12420 


cactaagcat 


tttgtacatt gagtagggaa gagagagaga tatatataaa tactcacaaa 


12480 


gcagtgagta 


tatggcttgc ttttgtttct aaacattcct ttataaggac ttgagataat 


12540 


ttgtattgtt 


taaaagagcc attggttctg ttgtatcaaa gtaatttaat tttcagtacc 


12600 


gacttggctc 


tactcatatg tggtggtgta cattgcatct ttgcatttga ttccatatct 


12660 


attgggatgg 


gaccatacat tttttgttta ttatttaagt caaacatatc agataatatg 


12720 


tggtgaccag 


aaaagtgtct agcaatcaaa tgggtctgtg attgcatgtt tgagttcagg 


12780 


acatttatta 


atgagtgaca gaattttaaa aagctcgcca ataggagttg ccacatctga 


12840 


tcgagtcttc 


tgtaaacact aactgtaact tgagtttgat aatcagattc cccccgttgg 


12900 


taggtaatat 


gtaaatttca ctatagattc ttactgagga tgatggtctt gacttcgaag 


12960 


tggtgagtgt 


aattctctaa tctaaacttt atatatatat aaaaaaagaa tgttaaaaac 


13020 


ttgagtgaca 


tacagcaaat agcaatgaaa gcatgaccat ggccttgggg ggccaaatgg 


13080 


ccatgtacca 


cttttccacc tacttgagag ccatggtcct gaatgaggtc cttttgtttc 


13140 


atttttttta 


caataaagaa agttcatatt cttggcaata gcaaggcaaa tcatttgtgg 


13200 


aagtcacatg 


aaattcacca taggcttcat tgttcaaatc ctttagtcac taaggcaggg 


13260 


cctatgtaag 


aattttgact actcttgtaa atgctgtgcg gtaaagacta cgggaaaata 


13320 


taaattattg 


catatctaga ggtgaggttc atttctccga tccatttctc caagtaataa 


13380 


aaatatatat 


aaattattaa tatggtatga tatggttatt tgtcctttaa atatgcgatt 


13440 


ataggttcaa 


tctttggtta atctatcaaa tgcatctcaa tattttgaga gaatagtttt 


13500 


tagctcttga 


ctgagagata ctttgagtta aaaaaatgta tttataaact attgtctatt 


13560 


gaatttttat 


tttttttgta acaccacatg tgagtatttt actaaatcca tgttacaatg 


13620 


tactgaatgt 


tttacaatac agtctggttt gataataaaa tatgcacaaa gtccagctta 


13680 


tgtaactctt 


tcaattcatg gaaaatgatg atatgcgaat tgaaagaaag aaaaccataa 


13740 


tgtaagtgta 


tttttttttt ctttctcttc tttaatatta ctcaatatca gattgcacat 


13800 


qqaaaaaqta 


ctggctcatg tgggatatta atttttatcc cttacctaaa aagagagaaa 


13860 


gagaaaataa 


ctatttaaaa agagtaccac caccaccttt ttcgctttta agttttgttc 


13920 


tgttttcact 


actctttcat gcgttgtttg ttggcttccg atagtgacga atatgataat 


13980 


gattttacgt 


tgagaaaaaa aatgtgccat gtaaattcca gagattgaaa ttttagcttc 


14040 


aaattaatat 


ttattttatt atcttttcaa attggttgca tgtgaaagaa tttgaagttc 


14100 
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ttcaacaagg aaataaacac ttcaatgctc 
agaatgtgat gtttacacac attagtatat 
taatgagtga tgcgcgagga cggtggtgac 
aatttttttt tctttaagta ttactctatc 
cttgcgaacg aagcaatctt gtcctgatcc 
acctaagcac atgatacttt gctgccctaa 
gatgcttttg ccttcatttt caagtttgtt 
ctctttcctt tctgggaccc cttgtcttta 
taattagtac tacttgggtt ccattaaatt 
ttggatcaac tgcatataca tatgttttat 
agttattaaa acataatgta gagccttgtc 
gctttgattt tgattgtacg ttttttgttt 
tgtcttggag tgctgtgatc tttagggtgc 
actaatacta gtagattcca cgcattcaac 
atggtaagaa aaggggagga gaaaagtaag 
cttttccatg acgtaatgaa aacattaatt 
tatgtcaaag aaacttagga ttttttctct 
aatgattaat ttgcgttgtt gaattcgttg 
gaaagttata tcaaagtatc atatactaga 
gtaatgcatg caattaatgg gcaaataaat 
ttcctccaaa acctctcaag ttcgcattga 
aaatctaaac catgtttgtt tcatcttgta 
gaatttagtt gttattgacc gttaaattta 
attgctttta ttcaacgcaa attctagtag 
cggtttgatt ctctacaaac ttgaattgag 
tcttaattaa ctctataatc aaacctttta 
agacaatcaa aatttaaact cacacgaatc 
gggagtccct taatagcaaa agcctcaaaa 
attcaacaaa ccacccatta ttatgttccc 
cacccataat taacccaaaa ggaagtttcc 
cctctataaa tctattttaa aaaaaccccc 
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aatccttaga gtcagtcact agatcccaaa 


14160 


atatatatat aaacacagct 


tatattatat 


14220 


actacatggc caaaacaaga aatgtaattt 


14280. 


cttgaatatt agtcttaaat 


cttcaaagac 


14340 


tgtgattgtt ctacactagt 


ccttacgtgt 


14400 


cataaattgt accataagga 


ttcttgggga 


14460 


tacactttca agtcgttttt 


tcaatgcatt 


14520 


acacgcgcgc acacaaaaac 


tagtagacaa 


14580 


gtaactacat atacatgtgt 


tttttttttc 


14640 


gagtttcatt gaatttatat 


ttacacaaac 


14700 


attaacctga tgttaggatt 


ataatttcat 


14760 


ttattgtaaa gtgtttattt 


tattaaaatt 


14820 


accgttcaaa agagagacga caaggataat 


14880 


cctagttagt ttagctatga 


caaattaagc 


14940 


aatgaaaaca tataacaaat 


tgtgattgaa 


15000 


ttaagataag cttcaactcc 


taagaagaga 


15060 


atttataacg gtaaattact 


catcctttta 


15120 


taaagggaaa atcaagaagt 


ctcttcacgg 


15180 


cttcatggct agcttttact 


ttgattataa 


15240 


tatttgaaac caactaaagt 


acaaataatt 


15300 


aattaaataa accaatgcat 


attaaggcgt 


15360 


ttgtatatca tagtattaat 


aaggtacttt 


15420 


ggggtgagtt gttttactat 


ctcttactca 


15480 


gattttcagc ccaaattaaa 


agaagtttta 


15540 


atttctaatt tcagaaatca ataattgcgg 


15600 


acccaatctt tttctcaatt 


ctctgaatta 


15660 


tcacacgcct tcccacggta gagaaatata 


15720 


ttcagaaaat ctcatccatt gcatttgcac 


15780 


ttaataaatc tcacacttaa 


aaaccctcct 


15840 


tccacacaaa aaatatcctt 


aaacaaagag 


15900 


aaatttaagg a 




15941 
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<210> 36 

<211> 15172 

<212> DNA 

<213> Glycine max EPSPS with double mutation 



<400> 36 
aaaagtctga 


atgaagtgaa 


gttgttgtgt tagggggatg ctgaatttga ttttaattta 


60 


atttaaataa 


attacctgtg 


taaaggattt ttgtaccgcc ttttaatcac aaattgtcgt 


120 


gtatcgaaag 


tttgttgaat 


tttatagtaa ctatcttgga ggttatacct agtaggattt 


180 


ctgattagtt 


aaccatgtaa 


atttttttaa tgggtttgtg tttgttttaa ctgatgttgt 


240 


ttttgatatg 


tacgtggtaa 


gtgtgatgag tttgtgcata tcttgatcaa agactgaagt 


300 


gaagttgttg 


tgtttggggg 


gttgactctt ttaggaagag gatttgaggg aatttcgcca 


360 


at gggggagc 


agaactccgg 


gacaccctga gaattttgag actccgggaa ttgaagttac 


420 


aacaggtttg 


atatgagtta 


gagtttgtat aatagaatag taattctttt tattttttat 


480 


ttttattgtt 


tatttatcca 


ttattgctat ttttgtaggt cctcttggtc aggggattgc 


540 


caatgctgtt 


ggtttggccc 


ttgcagagaa gcacttggct gcaagataca acaagcctga 


600 


caatgagatt 


gttgaccatt 


acacgtaaaa ataactaaat catacacaac ttgcaagact 


660 


ttagttgttg 


gtgttttgct 


aatggcgttt gttttatgca ggtatgctat tctgggtgat 


720 


ggttgtcaaa 


tggagggaat 


tgcaaatgaa gcatgctcac ttgctggcca ctggggactg 


780 


gggaagctga 


tagcatttta 


tgatgacaat catatttcta ttgatggtaa cacagagatt 


840 


gcgttcaccg 


agagtgttga 


tagtcgtttt gagggacttg ggtggcatgt tatttgggta 


900 


aagaatggaa 


ataatggcta 


tgatgatatt cgtgctgcca ttaaggaagc aaaagctgtc 


960 


aaagacaaac 


ccacattaat 


caaggttagt ttactaattg tggcaatgat atgcatagat 


1020 


ttctggcttg 


tagttaccat 


aacaatcata atctgtcatg atttataggg attcttttaa 


1080 


actaaggtta 


taaacatgat 


atgttactaa agactgcatt gcattgtagt acctcatttg 


1140 


agaaactgga 


agttataggt 


gggaaatcaa aatagtaaat tatgggtcag tttattaagt 


1200 


gtagctggtt 


tgtcttttgc 


taactctatc 'atttcacagg tcacaacaac cattggttat 


1260 


ggttctccta 


acaaggctaa 


ctcctacagt gtgcatggaa gtgcactggg tgccaaagaa 


1320 


gttgatgcca 


ccaggcagaa 


ccttggatgg tcacatgagc cattccacgt gcctgaggat 


1380 


gtcaaaaagt 


atggatgatg 


ggtttagaga tttctttttt cctgtgacac tgctgatgtt 


1440 
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tataatgata ctgaataaga attgcttgtc tgtccccagg cattggagtc gccacacccc 1500 
tgagggtgct gcacttgaag ctgagtggaa tgctaagttt gctgagtatg aaaagaaata 1560 
caaggaggaa gctgcagaat tgaaatctat tatcaatggt gaattccctg ctggttggga 1620 
gaaagcactt ccggtgagta aattctaaac tcacaggttt ttcttacatg tttgtaactt 1680 
ttcaaggttt ggattacata ctatccaaag ttgatcatga tttcagtttg atttggtagt 1740 
atggggcttc aaaaagttat tccaaatatt aaaacctggc ttccaatttg atttcagaca 1800 
tacactccag agagcccagc ggatgccacc agaaacctgt ctcaaacaaa ccttaatgcc 1860 
cttgcaaagg ttcttcccgg tctgcttggt ggcagtgcag atcttgcttc ttccaacatg 1920 
accttgctca aaatgttcgg ggacttccaa aaggatactc cagcagagcg taatgttaga 1980 
ttcggtgtta gagaacacgg aatgggagct atctgcaacg gcattgctct tcacagccct 2040 
ggactgattc catattgtgc aaccttcttt gtattcactg actacatgag aggtgccata 2100 
aggctttctg cgctgtctga ggctggggtt atttatgtca tgacccatga ttcaatagga 2160 
cttggagaag atgggccaac ccaccagcct attgagcacc tagcaagctt ccgggcaatg 2220 
ccaaacattt tgatgcttcg tcctgccgac ggtaacsgaa acagccsgag catacaaagt 2280 
ggccgtgctc aacaggaaag agaccctcca ttcttgccct atccaggcaa aaactgcccc 2340 
agcttcccgg aactthcatt gaaggagttg aaaagggtgg ttacaccatt tcgggacaac 2400 
tccactggca acaagcctga tgtcattttg atcggaactg gttcggaatt ggaaatcgct 2460 
gccaaagctg ctgatgacct aaggaaggaa gggaaggctg ttagagttgt ttcccttgtt 2520 
tcttggggaa ctttttgatg agcaatcaga agcstacaag gagagtgttt tccctgctgc 2580 
tgtttcagcc agagttagca ttgaggcagg atcaacattt gggtgggaga aaattgttgg 2640 
agcaaaggga aaagcaatag gcattgatcg ttttggagct agtgctccag ctggaagaat 2700 
atacaaagaa tttggtatca ctaaggaagc tgttgttgct gcagctaaag agcttatcta 2760 
gaacttttga tttttttttg ccttctggtt ttggttgaga gcattccatg tcatgaataa 2820 
gaaaaaggtt aaatatcctt cgattgggaa tttgggattc taagtgaaga agcaacatag 2880 
atcaaagaat tttgtgaact tgtaataact atttgacaat ttagttttgc actgccacat 2940 
ggttaaattc aatgtttgat catatcaacc atagatgcta gtttatacta cttgtataat 3000 
ttgtttcact tgcagattca ctcttagcat' ttgagtgttt ggatggccgt gttttgttca 3060 
ttaagaaatg ttgacatgga tagtttgttg catggtaaat tggtgattaa taccttttta 3120 
agcttctttt ggcaagttgt taaaaagtgt tttgttttaa tctctagtta atctaaaaat 3180 
gcctacttat tttattttga ctttcaaaac ttgtctatat gccgagtata aaaaaaactt 3240 
gtctatataa gttccttatg gaaaaaggac ctacaaaata atttaaaaat gctgaatttg 3300 
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ttaggaagtt tgacaacttt agacattaat aattgttttt ttttcatata tataattgac 3360 

tttcaaaact tgtctatata ttataaattc gttatggaaa gggcgaccac acgtaaaacg 3420 

taacatgcca atctctccct agtgaactgt tattctcatt cagtttttgt aagttcacct 3480 

tcaaattttc aatttttttt ttgacaaaaa tatctttgat ggaacacgtt aggggtgtaa 3540 

attcaccatt catcatttca ttcattatac acaatagtct ttcataatcc aatgaaaact 3600 

tgtttttgtt gttatttttt tctgaaaaat ctttataaaa taaataaaaa catttttttg 3660 

ttataaattt cagttgaaat acataaccat taattctaca gagtgttaat gttgatgtaa 3720 

aataaaaaaa tattggatat caaacttaat gaacagaaca aaaattacaa aattgtacaa 3780 

tttaaatgat aaaaaaccaa attcacaaaa ataaaaatta aggacgaaat ttgagaaatg 3 840 

ataattaaaa acgattaaaa atataattaa actaaaattt attatgtaat tctttaatat 3 900 

atatatatat atatatatat atatatatat atatatatat ataataaaag ttttacatta 3 960 

taattttgaa atgaagtatt tgataaacga acaacccacc cttggggttg gtgagagtga 4020 

aacggagaac agtgtctccc tcacttggtc ttctgagtcg agtcctattc atcaccaacc 4080 

cactcgtcaa aattgttgta tgctatgtaa ccattttgtt ttttaataaa taaaagtgtt 4140 

ttgttctgct atgcactccc catcaaatac catttacttt ttttttgtct tctatcaatt 4200 

tctatttagt tcctagttgc tacaacttcc agctggtgaa aaatagtcaa cagagactac 4260 

aatattctaa taaaatagaa taacaaaatc aaatatattc atttaatatg attttatttt 432 0 

tatttaattt aaaacatgaa ttatttgttt cttatttaat taccgtttcg ttcacttctc 4380 

aattaataaa ataaaaaatt tctacaatta tagaaaaagt ttagttttgt agattgttaa 4440 

cataaagatt tttcctatta actaattaat taaaagtcat cttatatata agttttaaag 4500 

taattatctt atatatgtac ccaaaagttt taaagcaatt attataaaaa agtaaaattt 4560 

tatcatatac aataatctat aattacatga aagtataatt tttaaattat taatacgtta 4620 

aaattatttt tttaaaattt gaatgatcat atagtaattt tattttatac caagtcccaa 4680 

catttcctat tttattccac ttttaccaat aaatttcctc ataaacactt ataagaaaaa 4740 

aagtaagtaa agaataaaat aagcttttct ataaactata aacaaatttg tataaacttt 4800 

cttatattag tttttctgaa aactaatttt aaattataca taaattgaat ttactgataa 4860 

ttttttattt tctttttgta taagtgtttt ttaagaaatt tgttcaaaca tatttttatg 4920 

acatgtcaat gcttgagaga tataaggtga gttaaatcat aaatgattaa agaaaaataa 4980 

aagaagaaaa agataattaa tttaattttt tattaacatt ctaacaaatt aacattatat 5040 

aaataaaaat aatatttcaa taattggatc aataatacta ttgttcttaa acaccaaacc 5100 
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aaccttccta 


aagtcctaat 


ctaatccaaa 


agccatcatt 


cattttcctt 


ggtaggtaaa . 


5160 


gttccaaaac 


ctttaccaac 


cttttcactc 


aattattttg 


gtgtaagcaa 


ttcgacatgt 


5220 


gttagtgtta 


gttggcaacc 


aaaaacccct 


ttatgtgact 


cactcaatcc 


aacaacctct 


5280 


cacaccacca 


acccccccat 


aactcataag 


gccaaaacca 


tttctcacaa 


aacccttcat 


5340 


tacacacatt 


attatcatta 


tcacacacac 


aaaaaacctc 


tcatttcaaa 


gagagagaga 


5400 


gatttcacag 


accaaattgc 


agaggacgac 


aaagttcaca 


acttcaagga 


aaatcgaaat 


5460 


ggcccaagtg 


agcagagtgc 


acaatcttgc 


tcaaagcact 


caaattttyg 


gycattcttc 


5520 


caayyccaac 


aaacycaaat 


cggygaattc 


ggtttcattg 


aggccacgcc 


tttggggksc 


5580 


ctcaaaatct 


cgcatcyygr 


tgmayaaaam 


tggaagcytt 


atgggaaatt 


ttaatgyggg 


5640 


gaagggaaat 


tccggcrtgt 


ttaaggtttc 


tgcatcsgtc 


gccgccgccg 


cagagaagcc 


5700 


gtcaacgtcg 


ccggagatcg 


tgttggaacc 


catcaaagac 


ttctcgggta 


ccatcacatt 


5760 


gccagggtcy 


aagtctctgt 


ccaatcgaat 


tttgcttctt 


gctgctctct 


ctgaggtaaa 


5820 


gtttatttat 


ttatttattt 


ctgtatccaa 


aaatgtaaat 


tgttagtttg 


ttaatttttg 


5880 


ttagcagtga 


gagtcgaact 


acttggcttc 


tttcttagtc 


atccaaccaa 


ccttatatct 


5940 


ccaaaattta 


tttattggtt 


ttttttgggg 


ttatgtggtt 


aggaatagga 


gtttgatgcg 


6000 


tggagtggat 


tttgaatatt 


tgattttttt 


tttttttgta 


ttattcagtg 


aaaatgaagc 


6060 


atcttgtccc 


atgaaagaaa 


tggacacgaa 


attaagtggc 


gtatgattta 


gaatgatgat 


6120 


agaaatgtgt 


ataggtggtt 


ttaatgtgta 


gcaataagca 


tattcaatat 


ctggattgat 


6180 


ttggacgttt 


ctgtataaag 


gagtatgcta 


gcaatgtgtt 


aataatgtct 


tgttaaaagt 


6240 


atggatggct 


aaaataatca 


taaaaatcga 


gtgggagtag 


tatacatatc 


tacaggaaat 


6300 


gtattaggtg 


aggcatttgg 


cttctctatt 


gcgagtaagg 


aacaagtaat 


ctcagttaat 


6360 


gtgaaaatca 


atggttgata 


ttccaataca 


ttcatgatgt 


gttatttacg 


ggaacgcaat 


6420 


attgactgtt 


gattttatct 


gcagctgaga 


ttatatacta 


tatccttcca 


aagaaatctt 


6480 


tgactcttga 


ttatccaagt 


atggttgtat 


taccaatttt 


agctctagaa 


gataatccct 


6540 


cccccaaaac 


acaaattaga 


atggtgctgc 


aagttccgtg 


ttactttcat 


tctattattt 


6600 


ttataacttt 


taattattta 


atagatgtct 


tgtttggcat 


aaactataat 


ttattctgtt 


6660 


tttttttatt 


cttatttagg 


gaacaactgt 


tgtagacaac 


ttgytgtaya 


gygaggatat 


6720 


tcattacatg 


cttggtgcat 


taaggaccct 


t aaa c h a ccr i~ 


at crcraacra va 


CLl^UICLCLCLl^CLCt^ 




caaacaagca 


attgtkgaag 


gctgtggggg 


attgtttccc 


actaktaarg 


aatctaaaga 


6840 


tgaaatcaat 


ttattccttg 


gaaatgctgg 


tactgcratg 


cgtcctttga 


cagcagctgt 


6900 


rgttgctgca 


ggtggaaatg 


caaggtctgt 


tttttgtttt 


gtttgttcat 


catgatctct 


6960 
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gaattgttcc tcgtataact aatcacatca gactatgtgt tcttcctcca tcctgttata 7020 

atctaaaaat ctcatccaga ttagtcatcc ttctttaaat gaacctttaa ttatatctat 7080 

gtatttattt aacatgtaaa ttagcttgtc aagtcaaagt ctagcatata gatatactga 7140 

ttacactctg aggaatgcac ctgagggtct taatcatgat ctatttcaac cttgccactt 7200 

tcttctttta ttattagatc acctatcatg attactggtt tgagtctcta aatagaccat 7260 

cttgatgttc aaaatatttc agctacgtac ttgatggrgt gccccgaatg agagagaggc 7320 

caattgggga tttggttgct ggtcttaagc arctyggtgc agatgttgat tgctttcttg 7380 

gcacaaactg tccacctgtt cgtgtaaatg ggaagggagg acttcctggc ggaaaggtat 7440 

ggtttggatt tccattagaa taaggtggac taactttcct ggagcaaaat tctaatttat 7500 

accctgtttc cccctcttta gaataagaca ctaagggtat gtttaggagt tgggttttgg 7560 

cgagaaaggg aagggagggc aatttttttt ctaataaata ttctttaatt tgataaattt 7620 . 

tttaaacgaa ggaatatgaa gatagattag cataacttaa tgttttaatc ttttatttat 7680 

ttttataaat attatatacc tctatttaaa aacaagatat ttttcctcca ttccctttcc 7740 

ctttaaaacc tcagttccaa atataccgta cttgaattat attttggaag gtgtattggt 7800 

tggagacctt tccttttcag aggttatccc tcacctttat tatagccttt ctactctctt 7860 

aatgagttca ttgtgcattg agtcattgaa cttcctgtga aaagaaattg tttacttttc 7920 

tttatttgtt acctacatcc ttattcctgt tttaaaaaat actaagtttt cttttagtta 7980 

tgccttcccc tccttattct ctccaaagaa aatataatag cgaatgttgg tttatccatt 8040 

gtgaatctta ttatcctatt tcacatgcag gtgaaactgt ctggatcagt tagcagtcaa 8100 

tacytractg ctttgcttat ggcagctcct ttagctcttg gygatgtgga aattgagatt 8160 

gttgataaac tgatttctgt tccatatgtt gaaatgactc tgaagttgat ggagcgtttt 822 0 

ggagtttctg tggaacacag tggtaattgg gatargttct tggtccatgg aggtcaaaag 8280 

tacaagtagg tttctatgtt ttagcattac atcacttttt agtatccaaa atgcaatgaa 8340 
atcaaaactc atgttttcta tcaggtctcc tggcaatgct tttgttgaag gtgatgcttc 8400 
aagtgccagt tayttmctag ctggtgcagc arttactggt gggactatca ctgttaatgg 8460 
ctgtggcaca agcagtttac aggtattcat gcttgacact attttctatt atcttttatt 8520 
ttatggtcta tactgtatca ttcctaattc aacccccttc ttttctcttt cccctcactt 8580 
ccaaaaaaat agagcttatc attgtattct taatagctct tgtatgtcct ttttcgtctc 8640 
ttgagcagat tctaccgtga ttatcctttt tagtaaactt tttttttgtg tgtcttttaa 8700 
ataattgata tcatcatatg aattcacaca tcatctctat cccttcctca ttttattata 8760 
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tttcattaac agtgataaag ataaaaataa attgtcaagt aaagcaagtt agcatgtcag 8820 

ataatgatgc ttcaattgca aaattggtga aagcttgcac agtacattag ctttggtgac 8880 

aattgtgatg tagatagcca taacctccca tgcaacttat tatgttgttt gcttaaactt 8940 

cttttgcact taaacacgga ttagagagag tgaggatgaa caataataac agacacaaag 9000 

cttagtttta taccacccgg cacccagtgg tgaagaccaa gcagtatgag tgagatcaat 9060 

ggcagcatct gattgaagct tcttgtggta tttttggaga tgaaatccga ggcgaattag 9120 

ggcatctggc tcaacaatag ccattggttc aaacttgctt ttaataaaat tgattacaga 9180 

atcaaattct tgaggaagcc ttcaagaact acatcatgat gttccctgtg caaaacaggt 9240 

tcaccgacaa aagagaacat ctacaagttc ctttacacat gccaaaaatt cctcaactga .9300 

acaattgcca agagtcatgt tgcgaacttc aggttggagt tgacgaaatt tggtgcatgt 9360 

atgattctgg aaatgagcat gaagtttttt cctaagctca taaaagtgaa tgcagccaag 9420 

aaccctagat aggattgaat cggagagagg cttgaagcca agataacagc aactgatctt 9480 

gctgttccca tgctgcgtag gactcattga ttctaccaag atcacaatct tctgtggtga 9540 

gaaaatgagg gtgaattgtc aaccaataaa gaaaatggtg aaggctgagt cttgatgata 9600 

ggcttgattt gcttgcgctg aagaagaacg ttattgttgt agagtttctc agtgattgta 9660 

tgagagtaag ttacaggtgc gattggagat ga'cggacttg tggtgggagg agggtaagtg 9720 

ttaagtggtg gaatgcgtat cagaggtggt tgcccagcca aagaagctgg ttgcggtgga 9780 

ggaggtggtg gtgcagtgga agccatgggt tttgatgctt ttgagagaaa acatacgaaa 9840 

ctgaactgta aacttttgta ttattgatta gctgatttgg tagtacaaga gaggaatata 9900 

tatacagttt taatgtaact acctaaccaa acatgaggaa ctctccatac acagaaagac 9960 

gtatacaaag gataacagac aaacagaaac ttagatatac tcatacagag aagaattaca 10020 

ggatgaatct gagaatttta attgggaaac tgaggaaaat agcaagggtt tcagaattca 10080 

ttttgacgat aggcaaatca tggacttaac ttagacctca agggtacgct tgggagtggg 10140 

ggtttttaga gggaaaaggg aaggagggca aacttttaat tcctaataaa tattctcttt 10200 

tttgattttt ttttaaataa aacaaataaa cttatagatt agcatacact taatgtccta 10260 

atcttttatt tatttgcatg agtttttttc ttttatattc tttctgtttc tgttttcttt 10320 

cttcagtgtc tcttgtcttt ccttaattcc ttatcaatac cttgtttatt tttgaaaact 10380 

taattttcct cattttttta acttataggc tatactatag tatgtataac tattcaaata 10440 

aatagtagaa aatgtttcaa atattcccca gaactgtttc agtgaacaaa gttcaagcca 10500 

gaaaaaagaa gtatatcttc ttgtgaagag tagtctcaag tggttaattg tcaaattgta 10560 

tcattttaga atacaaaatt agaagtggtt tgattagttt atacgaacga ttatttaaat 10620 
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agggttgatg atgtggttaa ttactgatca tatgtgaaga ctgtgagtcg atgaccagcc 10680 

cagtgcagtc attttaaatt ccattatttt ttaatttgtg cttatgttaa actttgtcct 10740 

tctttcaaag ggagatgtaa aatttgctga agttcttgaa aagatgggag ctaaggttac 10800 

atggtcagag aacagtgtca cygttwctgg accrccacra gattyttctg gtcraaaagt 10 860 

cttgcraggc attgatgtca atatgaacaa gatgccagat gttgccatga cwcttgcygt 10920 

tgtygcacta tttgctaatg gtcmmactrc yatmagagat ggtatggttt agctgtttgt 10980 

ttgtgttcaa gagcttgttt taacaaattg ttttttttaa ttatggttaa tgaaattatt 11040 

ataattacac taatattgaa attggatcac taaccatgtg aaatgcttga atttttagct 11100 

tgaaaacttc ttttgtggac aatttttctt ttatctatgg attgtggggc atctatgttg 11160 

ttcctttaag cttcgtacaa cctggtctgt gggctggtcc agtccttctt actcatgaca 11220 

gtcttttgca gtaaactggt taaacttggg tgacctgttg acccacccag attggcctgt 11280 

tctttttaaa taaaacttgg ctacttataa gatttcaaca tgatacttga atgatcaccc 11340 

ttactctaaa ggccattatc tgttcagtta tgttcttctt cttcttatta tttttaattt 11400 

atattattat ccaacgatct gatctttgaa catgatgctg gacaatcatc aaagctagtt 11460 

tggttacatc atataatacc ttttttttca tatattattt ttcccatagt atatttttgt 11520 

gttttttttt catacaatat tattctgaaa aaaacttgtt tcgttttttt cctggcttga 11580 

atgctatttc aatgactggg ccttattatt ttctggttta ggttatacaa ctgtcctcca 11640 

attgactttt gaatgcttca atagttttct tcttaatcat ggaaggcact agagtgcata 11700 

ttgaatatgt tagaacatgg aaggcttaaa tatcttttta gtctttgcaa tttaacgttt 11760 

ttttgctttt agtccttgca aaattataat ttttttttag tcctgactaa ttatgtttgt 11820 

tttgtttttg agggcgagcc ctggtgcagc ggtaaagttg tgccttggtg acttgttggt 11880 

catgggttcg aatccggaaa cagcctcttt gcatatgcaa gggtaaggct gcgtacaata 11940 

tccctccccc ataccttcgc atagcgaaga gcctctgggc aatggggtag aaaaagaaga 12000 

atgctttaga taacactttt tttgttactg ttcaaagcac tatctaaagt gctttgagga 12060 

ctaaacacaa aacaaacata atttgcaagg actgaaaaac aaaaaaataa ttttgcaagg 1212 0 
acaaaaaaac aaaaaaaaaa aacaaaattg caaggaccaa aaatatattt aaaccaacat 12180 
gaaattatgg tataaaacca ttcatatgtc ttcatctaat agcttaagct tttgggatag 12240 
ttggttaaca atatggtatt agagtcttat gatttttagg tggtctttga gttcaatcct 12300 
tgctgctccc agttctagtt aatttctttt tggtccactt caaataaaaa gttgaattta 12360 
agggtaaggc agctggacct gcgcattaat catgtgtcag gctcaaaggg cttttgtatg 1242 0 
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gggggcatgt ttggaatata atatagaatc 
tctagttttt gtcatacttg tattattaca 
aaggaaattt aacttaaaag gaacttagat 
ttttttccaa tctttttaac aattgaaagg 
tggactatct accatatcat ttagcaaact 
tctaccatat catttagcaa acctgttcac 
tctattfcttc agtggcaagt tggagagtta 
cagaactcag aaaggtgagt gtcttgctaa 
tttacatctc actgacctac tctgtcttgg 
gtcctgatta ctgtgtgatt actccacctg 
atgatgacca cagaatggcc atggcattct 
ccatcaagga tcctggttgc accaggaaga 
ggttmacaaa gcactaagca ttttgtacat 
cacaaaacag tgagtatatg catggcttgc 
ttgagataat ttgtattgtt ggaaagagct 
ttccagtaca gacttggctc tattcctcat 
tgatcccatt tctattggga taggaccata 
tagcagataa tatgtggtga ccagaaaata 
tgtttgaatt caggacttca ggacatttaa 
aacaggagtt gacacatctg attgagtctt 
ttatcagatt cctcctgttg gtaggtaata 
ggtggtctta cttgactttg aagaggtgag 
ttgctaaaag cttgagtggc atgcaacaaa 
aaccaaatgg ccacgtacca cttttccacc 
cctttttttt tttcatttct ttgacaataa 
cgtttgtgga agtcacatga aattcaccat 
aactaaaggc tggacctatg caagaatttt 
actacggaaa aatataaatt attgcatatc 
tttccaagaa ctaaaatata tataaattat 
tttcttttaa atatgtgatc ataagtttaa 
tattagaaaa gattaattta ttaaatggat 



attcatgtgg tttcacctat caaggacaac 12480 
gtgtctaaac tagacatgat ctttttccgg 12540 
gcttcttgtt ctgtgatctt tatttgttta 12600 
ttttagttgt atccaatctt tttatacatt 12660 
tgtttgctgc tcagattctc ctttggacta 12720 
tgctcagatt ctcatttatt ctttgtttcc 12780 
aagagactga gaggatgata gcaatctgca 12840 
ttcctttatg gtttcgatat tactggactt 12900 
tatttcagct aggagcaaca gttgaagaag 12960 
agaaattgaa tgtcacagct atagacacat 13020 
ctcttgctgc ttgtggggat gttccagtaa 13080 
catttccyga ctactttgaa gtccttgaga 13140 
tgagtaggaa gagagagaga gataaatact 132 00 
ttttgtttct taacattccc ttgtaagaac 13260 
attgattctg ttgtatcaaa gtaatttaat 1332 0 
atgtggtggt gtacattgca tctttgcatt 13380 
cctttttttg tttattattc aagtcaaact 13440 
gtctagctag caaatagacc tgtgattgca 13500 
ttaatgagtg acaatgttaa aaagctcgcc 13560 
ctattaacac taactgtaac ttgagtttga 13620 
tgtaaatttc actatatatt taccgaggtt 13680 
tgtaatttaa acttttatat caaaaaaaga 13740 
tagcaatgaa agaatgacta tggccttggg 13800 
aacttgagag ccctggtcct ggatgatgcc 13860 
agaaagttct tattcttggc aattgcaagt 13920 
aggcttcatt gttcaaatcc tttagtcact 13980 
ggctactctt ataaatgctg tgcggcaaag 14040 
tagaggtgag gttcattgct cccatccatt 14100 
taatatgata ttacattgaa aaataatatt 14160 
ttttaagtct attcatacga aaaaaatatt 14220 
ctcaatattt taagaaatta acttttagct 14280 
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ctcaaccaac agatgatttg agttaaaaaa atgtatatat aaactattgc atatctgctg 14340 

atttttgttt gtaacaccaa taagtgggca ttttactaaa tccatgtgac aatgtactga 14400 

atgttttaga atacagtctg gtttgataat aaaacgtgca taaagtccag cttatgttaa 14460 

ctcttaattc atggaaaatg atgatatgcg aattgaaaga aagaaaacca taatgtaagt 14520 

gcagttttct ctctcttttt taatattatt caatatcaga ttgcacatgg gaaggggtat 14580 

tggctcatgt gggatattag tttttgtccc ttacctaaaa agagagaaag agaaaaataa 14640 

caatttaaaa agagtaccac cgctaccacc tttttcgctt ttaaattttt gttctgtttt 14700 

cactactctt tcatgcgttg tttgttggct tctgatagtg acgaatatga tataatgatt 14760 

ttatgttggg aaaaaagtgc catgtaaatt ctagagattg aaattttagc atcaaattat 14820 

ttattttatc accttttaaa atttgttgca tgtgaaagaa tttgaagctc tccaacaagg 14880 

aaataacact tcaatcctta gagtcatcct ctagatccca aaagaatgtg atgtttatac 14940 

acattagtat atatatatat atatatatat atatatatat atatatatat atatatatat 15000 

atatatatat agggtttata ttatattaat gagtgatgta cgaggacggt ggtgacacta 15060 

tatggccaaa acaagtgtaa tataattatt tttttttctc aagtattact ctatccttga 15120 

atattattag tcttaaatct tcaagcacct tgtgaacgaa gcaatcttgt cc 15172 

<210> 37 
<211> 522 
<212> DNA 
<213> Enhancer 



<400> 37 

aagcttcccg ggccggccgc agctggcttg tggggaccag acaaaaaagg aatggtgcag 60 

aattgttagg cgcacctacc aaaagcatct ttgcctttat tgcaaagata aagcagattc 120 

ctctagtaca agtggggaac aaaataacgt ggaaaagagc tgtcctgaca gcccactcac 180 

taatgcgtat gacgaacgca gtgacgacca caaaactcga gacttttcaa caaagggtaa 240 

tatccggaaa cctcctcgga ttccattgcc cagctatctg tcactttatt gtgaagatag 300 

tggaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag gccatcgttg 360 

aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg agcatcgtgg 420 

aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgat atctccactg 480 

acgtaaggga tgacgcacaa tcccactatc cttcttaatt aa 522 



47 



WO 02/26995 PCT/GB01/04131 

<210> 38 

<211> 552 

<212> DNA 

<213> Enhancer 

<400> 38 

aagcttcccg ggccggccgc agctggcttg tggggaccag acaaaaaagg aatggtgcag 60 

aattgttagg cgcacctacc aaaagcatct ttgcctttat tgcaaagata aagcagattc 120 

ctctagtaca agtggggaac aaaataacgt ggaaaagagc tgtcctgaca gcccactcac 180 

taatgcgtat gacgaacgca gtgacgacca caaaactcga gaacatggtg gagcacgaca 240 

cacttgtcta ctccaagaat atcaaagata cagtctcaga agaccagagg gctattgaga 300 

cttttcaaca aagggtaata tcgggaaacc tcctcggatt ccattgccca gctatctgtc 360 

acttcatcga aaggacagta gaaaaggaag atggcttcta caaatgccat cattgcgata 420 

aaggaaaggc tatcgttcaa gatgcctcta ccgacagtgg tcccaaagat ggacccccac 480 

ccacgaggaa catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 540 

gatgttaatt aa 552 

<210> 39 

<211> 28 

<212> DNA 

<213> Primer 



<400> 39 

ctgcagggat cctggttgca ccaggaag 28 

<210> 40 

<211> 29 

<212> DNA 

<213> Primer 



<400> 40 

gaattcccgg gtatggtgaa tttcatgtg 29 
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<210> 41 

<211> 25 

<212> DNA 

<213> Primer 



<400> 41 25 
gacttctcgg gtaccatcac attgc 



<210> 42 

<211> 55 

<212> DNA 

<213> Primer 



<400> 42 

cacctgcagc aaccacagct gctgtcaaag aacgcattgc aataccagca tttcc 

<210> 43 

<211> 27 

<212> DNA 

<213> Primer 



<400> 43 

aatttaatta attcataaat atgttac A 1 



<210> 44 

<211> 73 

<212> DNA 

<213> Primer 



<400> 44 cn 
ggtcttctcc ttcttagttc tcaactactt gttcttctgc tgccagattt caattttcct 60 

73 

taaagttgtg aac 



<210> 45 



49 



WO 02/26995 



PCT7GB01/04131 



<211> 72 



<212> DNA 



<213> Primer 



<400> 45 

atctggcagc agaagaacaa gtagttgaga actaagaagg agaagaccat ggcccaagtg 



60 



agcagagtgc ac 



72 



<210> 46 

<211> 24 

<212> DNA 

<213> Primer 

<400> 46 

ggcaatgtga tggatcccga gaag 24 

<210> 47 

<211> 27 

<212> DNA 

<213> Primer 

<400> 47 

gcgccatggc ggcatcggtt tccaggg 27 

<210> 48 

<211> 33 

<212> DNA 

<213> Primer 



<400> 48 

gcgggtacct cagacaagca cctgagctgc age 



33 



<210> 49 



<211> 25 



50 
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<212> DNA 

<213> Primer 

<400> 49 

gcggcatgcc caacaatggc tcgtc 

<210> 50 

<211> 28 

<212> DNA 

<213> Primer 

<400> 50 

gcggatcctt atttcttcag ttcagcca 

<210> 51 

<211> 9 

<212> PRT 

<213> Motif 

<220> 

<221> MIS C_F EATURE 
<222> (5).. (5) 

<223> X is any amino acid but not S or 

<400> 51 

Ala Leu Leu Met Xaa Ala Pro Leu Ala 
1 5 

<210> 52 
<211> 8 

<212> PRT 

<213> Motif 
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28 
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<220> 

<221> MIS COFEATURE 

<222> (5) . . (5) 

<223> X is any amino acid but not V 

<400> 52 

Glu He Glu He Xaa Asp Lys Leu 
1 5 

<210> 53 

<211> 7 

<212> PRT 

<213> Motif 

<220> 

<221> MIS COFEATURE 

<222> (3).. (3) 

<223> X is V or I 

<220> 

<221> MIS C_FEATURE 

<222> (4) . . (4) 

<223> X is K or S 

<220> 

<221> MIS COFEATURE 

<222> (5) . . (5) 

<223> X is any amino acid but not V 

<400> 53 

Phe Gly Xaa Xaa Xaa Glu His 
1 5 
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<210> 54 

<211> 7 

<212> PRT 

<213> Motif 

<220> 

<221> MISC_FEATURE 

<222> (3).. (3) 

<223> X is K orR 

<220> 

<221> MISC_FEATURE 

<222> (4).. (4) 

<223> X is any amino acid but not R 

<400> 54 

Ala Leu Xaa Xaa Leu Gly Leu 
1 5 

<210> 55 

<211> 15 

<212> PRT 

<213> Motif 

<220> 

<221> MISCJFEATURE 

<222> (3).. (3) 

<223> X is any amino acid 

<220> 

<221> MIS COFEATURE 

53 
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<222> (6) . . (6) 

<223> X is any amino acid but not T (and/or the amino acid at position 
9 is not E) 

<220> 

<221> MIS C_F EATURE 

<222> (8).. (8) 

<223> X is any amino acid 

<220> 

<221> MIS C_F EATURE 

<222> (9).. (9) 

<223> X is any amino acid but not E (and/ or the aminoacidat position 6 
is any amino acid but not T) 

<220> 

<221> MI S C_FEATURE 

<222> (10).. (12) 

<223> X is any amino acid 

<220> 

<221> MIS C_FEATURE 

<222> (14) . . (14) 

<223> X is I or V 

<400> 55 

Gly Leu Xaa Val Glu Xaa Asp Xaa Xaa Xaa Xaa Xaa Ala Xaa Val 
15 10 15 

<210> 56 

<211> 9 

<212> PRT 
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<213> Motif 
<220> 

<221> MIS C_FEATURE 

<222> (5).. (5) 

<223> X is any amino acid but not K (and/or the amino acid at position 
9 is any amino acid but not T) 

<220> 

<221> MI SC_FEATURE 

<222> (7).. (7) 

<223> X is L or V 

<220> 

<221> MIS COFEATURE 

<222> (8).. (8) 

<223> X is K or N 

<220> 

<221> MIS C_FEATURE 

<222> (9).. (9) 

<223> X is any amino acid but not T (and/or the amino acid at position 
5 is any amino acid but not K) 

<400> 56 

lie Thr Pro Pro Xaa Lys Xaa Xaa Xaa 
1 5 

<210> 57 

<211> 8 

<212> PRT 

<213> Motif 



55 
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<220> 

<221> MIS COFEATURE 

<222> (3).. (3) 

<223> X is any amino acid but not N or L 
<220> 

<221> MISC_FEATURE 

<222> (4) . . (4) 

<223> X is D or N 

<400> 57 

Thr lie Xaa Xaa Pro Gly Cys Thr 

1 5 

<210> 58 

<211> 12 

<212> PRT 

<213> Motif 

<220> 

<221> MIS C_FEATURE 

<222> (1)..(1) 

<223> X is D or N 

<220> 

<221> MIS C_FEATURE 

<222> (4) . . (4) 

<223> X is any amino acid 

<220> 

56 
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<221> MISC_FEATURE 

<222> (7).. (7) 

<22 3> X is any amino acid 

<220> 

<221> MIS COFEATURE 

<222> (8).. (8) 

<223> X is any amino acid but not R 
<220> 

<221> MIS COFEATURE 

<222> (9).. (10) 

<223> X is any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (11).. (11) 

<223> X is K or R 

<400> 58 

Xaa Tyr Phe Xaa Val Leu Xaa Xaa Xaa Xaa Xaa His 
1 5 10 

<210> 59 

<211> 12 

<212> PRT 

<213> Motif 

<220> 

<221> MIS COFEATURE 

<222> (1) . . (1) 

<223> X is I or V 

57 
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<220> 

< 2 2 1 > MIS COFEATURE 

<222> (8) . . (8) 

<223> X is I or L or Q 

<220> 

<221> MISC_FEATURE 

<222> (11).. (11) 

<223> X is V or A or T 

<220> 

<221> MISC_FEATURE 

<222> (12).. (12) 

<223> X is S or G 

<400> 59 

Xaa Val Glu Gly Cys Gly Gly Xaa Phe Pro Xaa Xaa 
15 10 

<210> 60 

<211> 12 

<212> PRT 

<213> Motif 

<220> 

<221> MISC_FEATURE 
<222> (1)..(1) 

<223> X is I or V 

<220> 

58 



BNSDOCID: <WO 0226995A1 I > 



WO 02/26995 



PCT/GB01/04131 



<221> MIS C_FEATURE 

<222> (8) . . (8) 

<223> X is I or L or Q 

<220> 

<221> MIS C_FEATURE 

<222> (11) . . (11) 

<223> X is V or A or T 

<400> 60 

Xaa Val Val Gly Cys Gly Gly Xaa Phe Pro Xaa Glu 
15 10 

<210> 61 

<211> 12 

<212> PRT 

<213> Motif 

<220> 

<221> MI SC__FEATURE 

<222> (1)..(1) 

<223> X is I or V 

<220> 

<221> MIS COFEATURE 

<222> (8).. (8) 

<223> X is I or L or Q 

<220> 

<221> MIS COFEATURE 

<222> (11).. (11) 

<223> X is V or A or T 
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<220> 

<221> MI SC_FEATURE 

<222> (12).. (12) 

<223> X is S or G 

<400> 61 

Xaa Val Val Gly Cys Gly Gly Xaa Phe Pro Xaa Xaa 
1 5 10 

<210> 62 

<211> 12 

<212> PRT 

<213> Motif 



<220> 

<221> MIS C_FEATURE 

<222> (1) . . (1) 

<223> X is I or V 



<220> 

<221> MIS C_FEATURE 

<222> (11) . . (11) 

<223> X is V or A or T 



<220> 

<221> MI SC_FEATURE 

<222> (12) . . (12) 

<223> X is S or G 



<400> 62 



60 
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Xaa Val Val Gly Cys Gly Gly Lys Phe Pro Xaa Xaa 
15 10 

<210> 63 

<211> 12 

<212> PRT 

<213> Motif 

<220> 

<221> MI SC_FEATURE 

<222> (1)..(1) 

<223> X is I or V 

<220> 

<221> MISC_FEATURE 

<222> (11).. (11) 

<223> X is V or A or T 

<400> 63 

Xaa Val Glu Gly Cys Gly Gly Lys Phe Pro Xaa Glu 
15 10 

<210> 64 

<211> 12 

<212> PRT 

<213> Motif 

<220> 

<221> MISC_FEATURE 

<222> (1) . . (lr) 

<223> X is I or V 
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<220> 

<221> MISC_FEATURE 

<222> (8) . . (8) 

<223> X is I or L or Q 

<220> 

<221> MI S C_FEATURE 

<222> (11) . . (11) 

<223> X is V or A or T 

<400> 64 

Xaa Val Glu Gly Cys Gly Gly Xaa Phe Pro Xaa Glu 
15 10 

<210> 65 

<211> 12 

<212> PRT 

<213> Motif 

<220> 

<221> MIS C_FEATURE 

<222> (1) . . (1) 

<223> X is I or V 

<220> 

<221> MISC__FEATURE 

<222> (11).. (11) 

<223> X is V or A or T 

<220> 

<221> MI SC_FEATURE 

62 



RN5UTnCtI> <WO 0226995A1 I > 



WO 02/26995 PCT/GBO 1/04 131 

<222> (12).. (12) 
<223> X is S or G 

<400> 65 

Xaa Val Glu Gly Cys Gly Gly Lys Phe Pro Xaa Xaa 
15 10 

<210> 66 

<211> 10 

<212> PRT 

<213> Motif 

<400> 66 

Gly Asn Ala Gly Thr Ala Met Arg Pro Leu 
15 10 

<210> 67 

<211> 10 

<212> PRT 

<213> Motif 

<400> 67 

Gly Asn Ala Gly lie Ala Met Arg Ser Leu 
1 5 10 

<210> 68 

<211> 18 

<212> PRT 

<213> Motif 

<220> 

<221> MISC_FEATURE 
<222> (2).. (2) 
<223> X is A or V 
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<400> 68 

Lys Xaa Ala Lys Arg Ala Val Val Val Gly Cys Gly Gly Lys Phe Pro 
15 10 15 

Val Glu 

<210> 69 

<211> 14 

<212> PRT 

<213> Motif 

<400> 69 

Gly Asn Ala Gly Thr Ala Met Arg Pro Leu Thr Ala Ala Val 
15 10 

<210> 70 

<211> 14 

<212> PRT 

<213> Motif 



<400> 70 

Gly Asn Ala Gly lie Ala Met 
1 5 



Arg Ser Leu Thr Ala Ala Val 
10 
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